npx skills add https://github.com/hyperb1iss/hyperskills --skill plan基于验证驱动的任务分解,配合 Sibyl 原生追踪功能。从 200 多个真实规划会话中提炼而来 —— 这些是真正经受住代码考验的规划。
核心洞见: 当步骤无法验证时,规划就会失败。不断分解,直到每个步骤都有具体的检查项。在 Sibyl 中追踪,使规划能够跨越上下文窗口的限制。
digraph planning {
rankdir=TB;
node [shape=box];
"1. SCOPE" [style=filled, fillcolor="#e8e8ff"];
"2. EXPLORE" [style=filled, fillcolor="#ffe8e8"];
"3. DECOMPOSE" [style=filled, fillcolor="#e8ffe8"];
"4. VERIFY & APPROVE" [style=filled, fillcolor="#fff8e0"];
"5. TRACK" [style=filled, fillcolor="#e8e8ff"];
"1. SCOPE" -> "2. EXPLORE";
"2. EXPLORE" -> "3. DECOMPOSE";
"3. DECOMPOSE" -> "4. VERIFY & APPROVE";
"4. VERIFY & APPROVE" -> "5. TRACK";
"4. VERIFY & APPROVE" -> "3. DECOMPOSE" [label="gaps found", style=dashed];
}
在探索之前划定工作边界。
在 Sibyl 中搜索相关任务、决策、先前的计划:
sibyl search "<feature keywords>"
sibyl task list -s todo
定义成功标准 —— “完成”是什么样子?
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
识别约束条件:
评估复杂度:
| 规模 | 描述 | 规划深度 |
|---|---|---|
| 快速修复 | < 3 个文件,解决方案明确 | 跳过规划,直接实施 |
| 功能 | 3-10 个文件,已知模式 | 轻量级规划(本技能) |
| 史诗 | 10+ 个文件,新模式 | 完整规划 + 编排 |
| 重构 | 架构变更 | 完整规划 + 前期研究 |
如果这是一个快速修复,请停止规划,直接开始构建。为 5 分钟的修复做规划是浪费。
在分解之前理解代码库的影响范围。
映射影响范围 —— 这会涉及哪些文件/模块?
识别现有模式:
查找依赖关系:
对变更范围的心智模型:
“这涉及:[模块 A](新端点)、[模块 B](类型变更)、[模块 C](测试)。模式遵循[现有功能 X]。依赖于[基础设施 Y]的可用性。”
将工作分解为可验证的步骤。每个步骤都必须有一个检查项。
没有验证方法的步骤不是一个步骤 —— 它只是一个希望。
为每个任务定义:
| 字段 | 描述 |
|---|---|
| 做什么 | 具体的实施行动 |
| 文件 | 要创建/修改的确切文件 |
| 验证 | 如何确认其正常工作 |
| 依赖于 | 哪些任务必须先完成 |
| 方法 | 何时使用 |
|---|---|
typecheck | 类型变更、接口添加 |
test | 逻辑、边界情况、集成 |
lint | 风格、格式化、导入顺序 |
build | 构建系统变更 |
visual | UI 变更(截图或浏览器检查) |
curl/httpie | API 端点变更 |
manual | 仅当没有自动化方法存在时 |
## 任务 [N]: [祈使句标题]
**文件:** `src/path/file.ts`, `tests/path/file.test.ts`
**依赖于:** 任务 [M]
**可并行:** 是/否(可与任务 [X] 同时运行)
### 实施
[2-4 个要点的行动列表]
### 验证
- [ ] `pnpm typecheck` 通过
- [ ] `pnpm test -- file.test.ts` 通过
- [ ] [关于行为的具体断言]
标记可以同时运行的任务以便编排:
第 1 波(基础): 任务 1, 任务 2 [可并行]
第 2 波(核心): 任务 3, 任务 4 [可并行,依赖于第 1 波]
第 3 波(集成): 任务 5 [顺序执行,依赖于第 2 波]
第 4 波(完善): 任务 6, 任务 7 [可并行,依赖于第 3 波]
在执行前审查计划。
在呈现给用户之前,请验证:
以带波次的结构化列表展示计划:
## 计划: [功能名称]
**成功标准:** [可衡量的结果]
**预估任务数:** [N] 个,分布在 [M] 个波次中
**可并行化:** [X]% 的任务可以并行运行
### 第 1 波: 基础
- [ ] 任务 1: [标题] → 验证: [方法]
- [ ] 任务 2: [标题] → 验证: [方法]
### 第 2 波: 核心实现
- [ ] 任务 3: [标题] → 验证: [方法] (依赖于: 1)
- [ ] 任务 4: [标题] → 验证: [方法] (依赖于: 2)
### 第 3 波: 集成
- [ ] 任务 5: [标题] → 验证: [方法] (依赖于: 3, 4)
展示后,明确检查:
在 Sibyl 中注册计划以实现持久化。
为功能创建一个 Sibyl 任务:
sibyl task create --title "[功能]" -d "[成功标准]" --complexity epic
为每个计划步骤创建子任务:
sibyl task create --title "任务 1: [标题]" -e [epic-id] -d "[实施 + 验证]"
记录计划决策:
sibyl add "计划: [功能]" "[N] 个任务分布在 [M] 个波次中。关键决策:[架构选择]。依赖关系:[关键路径]。"
当计划遇到现实情况时会发生改变。当一个任务揭示出意外的复杂性时:
一旦计划获得批准,就将其移交给合适的工具:
| 情况 | 交接对象 |
|---|---|
| 3-5 个简单任务,用户在场 | 直接执行,配合验证关卡 |
| 5-15 个任务,混合并行 | 使用 /hyperskills:orchestrate 配合波次策略 |
| 大型史诗,15+ 个任务 | 使用史诗并行构建策略进行编排 |
| 需要先进行更多研究 | 在执行前使用 /hyperskills:research |
| 阶段 | 审查级别 | 时机 |
|---|---|---|
| 完整仪式 | 实施 + 规格审查 + 代码审查 | 前 3-4 个任务 |
| 标准 | 实施 + 规格审查 | 任务 5-8,模式已稳定 |
| 轻量 | 实施 + 快速验证 | 后期任务,已建立模式 |
这是赢得的信心,而不是偷工减料。
每周安装量
135
仓库
GitHub 星标数
2
首次出现
2026年2月19日
安全审计
安装于
claude-code130
opencode14
gemini-cli14
github-copilot14
codex14
kimi-cli14
Verification-driven task decomposition with Sibyl-native tracking. Mined from 200+ real planning sessions — the plans that actually survived contact with code.
Core insight: Plans fail when steps can't be verified. Decompose until every step has a concrete check. Track in Sibyl so plans survive context windows.
digraph planning {
rankdir=TB;
node [shape=box];
"1. SCOPE" [style=filled, fillcolor="#e8e8ff"];
"2. EXPLORE" [style=filled, fillcolor="#ffe8e8"];
"3. DECOMPOSE" [style=filled, fillcolor="#e8ffe8"];
"4. VERIFY & APPROVE" [style=filled, fillcolor="#fff8e0"];
"5. TRACK" [style=filled, fillcolor="#e8e8ff"];
"1. SCOPE" -> "2. EXPLORE";
"2. EXPLORE" -> "3. DECOMPOSE";
"3. DECOMPOSE" -> "4. VERIFY & APPROVE";
"4. VERIFY & APPROVE" -> "5. TRACK";
"4. VERIFY & APPROVE" -> "3. DECOMPOSE" [label="gaps found", style=dashed];
}
Bound the work before exploring it.
Search Sibyl for related tasks, decisions, prior plans:
sibyl search "<feature keywords>"
sibyl task list -s todo
Define success criteria — what does "done" look like?
Identify constraints:
Classify complexity:
| Scale | Description | Planning Depth |
|---|---|---|
| Quick fix | < 3 files, clear solution | Skip to implementation |
| Feature | 3-10 files, known patterns | Light plan (this skill) |
| Epic | 10+ files, new patterns | Full plan + orchestration |
| Redesign | Architecture change | Full plan + research first |
If this is a quick fix , stop planning and go build. Planning a 5-minute fix is waste.
Understand the codebase surface area before decomposing.
Map the impact surface — which files/modules will this touch?
Identify existing patterns:
Find the dependencies:
A mental model of the change surface:
"This touches: [module A] (new endpoint), [module B] (type changes), [module C] (tests). Pattern follows [existing feature X]. Depends on [infrastructure Y] being available."
Break work into verifiable steps. Every step must have a check.
A step without a verification method is not a step — it's a hope.
For each task, define:
| Field | Description |
|---|---|
| What | Specific implementation action |
| Files | Exact files to create/modify |
| Verify | How to confirm it works |
| Depends on | Which tasks must complete first |
| Method | When to Use |
|---|---|
typecheck | Type changes, interface additions |
test | Logic, edge cases, integrations |
lint | Style, formatting, import order |
build | Build system changes |
visual | UI changes (screenshot or browser check) |
curl/httpie | API endpoint changes |
## Task [N]: [Imperative title]
**Files:** `src/path/file.ts`, `tests/path/file.test.ts`
**Depends on:** Task [M]
**Parallel:** Yes/No (can run alongside Task [X])
### Implementation
[2-4 bullet points of what to do]
### Verify
- [ ] `pnpm typecheck` passes
- [ ] `pnpm test -- file.test.ts` passes
- [ ] [specific assertion about behavior]
Mark tasks that can run simultaneously for orchestration:
Wave 1 (foundation): Task 1, Task 2 [parallel]
Wave 2 (core): Task 3, Task 4 [parallel, depends on Wave 1]
Wave 3 (integration): Task 5 [sequential, depends on Wave 2]
Wave 4 (polish): Task 6, Task 7 [parallel, depends on Wave 3]
Review the plan before executing it.
Before presenting to the user, verify:
Show the plan as a structured list with waves:
## Plan: [Feature Name]
**Success criteria:** [measurable outcome]
**Estimated tasks:** [N] across [M] waves
**Parallelizable:** [X]% of tasks can run in parallel
### Wave 1: Foundation
- [ ] Task 1: [title] → verify: [method]
- [ ] Task 2: [title] → verify: [method]
### Wave 2: Core Implementation
- [ ] Task 3: [title] → verify: [method] (depends: 1)
- [ ] Task 4: [title] → verify: [method] (depends: 2)
### Wave 3: Integration
- [ ] Task 5: [title] → verify: [method] (depends: 3, 4)
After presenting, explicitly check:
Register the plan in Sibyl for persistence.
Create a Sibyl task for the feature:
sibyl task create --title "[Feature]" -d "[success criteria]" --complexity epic
Create sub-tasks for each plan step:
sibyl task create --title "Task 1: [title]" -e [epic-id] -d "[implementation + verify]"
Record the plan decision:
sibyl add "Plan: [feature]" "[N] tasks across [M] waves. Key decisions: [architectural choices]. Dependencies: [critical path]."
Plans change when they meet reality. When a task reveals unexpected complexity:
Once the plan is approved, hand off to the right tool:
| Situation | Handoff |
|---|---|
| 3-5 simple tasks, user present | Execute directly with verification gates |
| 5-15 tasks, mixed parallel | /hyperskills:orchestrate with wave strategy |
| Large epic, 15+ tasks | Orchestrate with Epic Parallel Build strategy |
| Needs more research first | /hyperskills:research before executing |
| Phase | Review Level | When |
|---|---|---|
| Full ceremony | Implement + spec review + code review | First 3-4 tasks |
| Standard | Implement + spec review | Tasks 5-8, patterns stabilized |
| Light | Implement + quick verify | Late tasks, established patterns |
This is earned confidence, not cutting corners.
Weekly Installs
135
Repository
GitHub Stars
2
First Seen
Feb 19, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykPass
Installed on
claude-code130
opencode14
gemini-cli14
github-copilot14
codex14
kimi-cli14
站立会议模板:敏捷开发每日站会指南与工具(含远程团队异步模板)
10,500 周安装
manual | Only when no automation exists |