重要前提
安装AI Skills的关键前提是:必须科学上网,且开启TUN模式,这一点至关重要,直接决定安装能否顺利完成,在此郑重提醒三遍:科学上网,科学上网,科学上网。查看完整安装教程 →
systematic-debugging by guanyang/antigravity-skills
npx skills add https://github.com/guanyang/antigravity-skills --skill systematic-debugging随机修复浪费时间并会引入新错误。快速补丁掩盖了根本问题。
核心原则: 在尝试修复之前,必须找到根本原因。症状修复就是失败。
违反此流程的字面要求就是违反调试的精神。
NO FIXES WITHOUT ROOT CAUSE INVESTIGATION FIRST
如果你没有完成第一阶段,就不能提出修复方案。
适用于任何技术问题:
特别在以下情况使用:
不要跳过的情况:
在进入下一阶段之前,必须完成当前阶段。
在尝试任何修复之前:
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
仔细阅读错误信息
稳定复现
检查最近的变更
在多组件系统中收集证据
当系统有多个组件时(CI → 构建 → 签名,API → 服务 → 数据库):
在提出修复方案之前,添加诊断工具:
For EACH component boundary:
- Log what data enters component
- Log what data exits component
- Verify environment/config propagation
- Check state at each layer
Run once to gather evidence showing WHERE it breaks
THEN analyze evidence to identify failing component
THEN investigate that specific component
示例(多层系统):
# Layer 1: Workflow
echo "=== Secrets available in workflow: ==="
echo "IDENTITY: ${IDENTITY:+SET}${IDENTITY:-UNSET}"
# Layer 2: Build script
echo "=== Env vars in build script: ==="
env | grep IDENTITY || echo "IDENTITY not in environment"
# Layer 3: Signing script
echo "=== Keychain state: ==="
security list-keychains
security find-identity -v
# Layer 4: Actual signing
codesign --sign "$IDENTITY" --verbose=4 "$APP"
这揭示了: 哪一层失败了(密钥 → 工作流 ✓,工作流 → 构建 ✗)
当错误位于调用栈深处时:
查看本目录中的 root-cause-tracing.md 以获取完整的向后追踪技术。
快速版本:
* Where does bad value originate?
* What called this with bad value?
* Keep tracing up until you find the source
* Fix at source, not at symptom
在修复之前找到模式:
找到工作示例
与参考实现进行比较
识别差异
理解依赖关系
科学方法:
形成单一假设
最小化测试
在继续之前验证
当你不知道时
修复根本原因,而不是症状:
创建失败的测试用例
superpowers:test-driven-development 技能来编写正确的失败测试实施单一修复
验证修复
如果修复不起作用
如果 3+ 次修复失败:质疑架构
表明存在架构问题的模式:
* Each fix reveals new shared state/coupling/problem in different place
* Fixes require "massive refactoring" to implement
* Each fix creates new symptoms elsewhere
停止并质疑基本原则:
* Is this pattern fundamentally sound?
* Are we "sticking with it through sheer inertia"?
* Should we refactor architecture vs. continue fixing symptoms?
在尝试更多修复之前,与你的真人伙伴讨论
这不是失败的假设 - 这是错误的架构。
如果你发现自己有这些想法:
所有这些都意味着:停止。返回第一阶段。
如果 3+ 次修复失败: 质疑架构(见第四阶段第 5 步)
注意这些纠正信号:
当你看到这些时: 停止。返回第一阶段。
| 借口 | 现实 |
|---|---|
| "问题很简单,不需要流程" | 简单的问题也有根本原因。流程对于简单的错误来说很快。 |
| "紧急情况,没时间走流程" | 系统化调试比猜测-检查式的胡乱尝试更快。 |
| "先试试这个,然后再调查" | 第一次修复设定了模式。从一开始就做对。 |
| "确认修复有效后我会写测试" | 未经测试的修复不牢固。先写测试可以证明它。 |
| "一次做多个修复节省时间" | 无法隔离是什么起了作用。会导致新的错误。 |
| "参考太长,我会调整这个模式" | 部分理解必然导致错误。完整阅读它。 |
| "我看到了问题,让我修复它" | 看到症状 ≠ 理解根本原因。 |
| "再试一次修复"(2+ 次失败后) | 3+ 次失败 = 架构问题。质疑模式,不要再修复。 |
| 阶段 | 关键活动 | 成功标准 |
|---|---|---|
| 1. 根本原因 | 阅读错误、复现、检查变更、收集证据 | 理解是什么和为什么 |
| 2. 模式 | 找到工作示例、比较 | 识别差异 |
| 3. 假设 | 形成理论、最小化测试 | 确认或形成新假设 |
| 4. 实施 | 创建测试、修复、验证 | 错误解决、测试通过 |
如果系统调查揭示问题确实是环境性的、依赖于时间的或外部的:
但是: 95% 的"没有根本原因"案例是调查不完整。
这些技术是系统性调试的一部分,可在本目录中找到:
root-cause-tracing.md - 通过调用栈向后追踪错误以找到原始触发点defense-in-depth.md - 在找到根本原因后,在多个层面添加验证condition-based-waiting.md - 用条件轮询替换任意超时相关技能:
来自调试会话的数据:
每周安装
56
仓库
GitHub 星标
518
首次出现
2026年1月26日
安全审计
安装于
opencode48
codex46
cursor45
github-copilot45
gemini-cli45
kimi-cli44
Random fixes waste time and create new bugs. Quick patches mask underlying issues.
Core principle: ALWAYS find root cause before attempting fixes. Symptom fixes are failure.
Violating the letter of this process is violating the spirit of debugging.
NO FIXES WITHOUT ROOT CAUSE INVESTIGATION FIRST
If you haven't completed Phase 1, you cannot propose fixes.
Use for ANY technical issue:
Use this ESPECIALLY when:
Don't skip when:
You MUST complete each phase before proceeding to the next.
BEFORE attempting ANY fix:
Read Error Messages Carefully
Reproduce Consistently
Check Recent Changes
Gather Evidence in Multi-Component Systems
WHEN system has multiple components (CI → build → signing, API → service → database):
BEFORE proposing fixes, add diagnostic instrumentation:
For EACH component boundary:
- Log what data enters component
- Log what data exits component
- Verify environment/config propagation
- Check state at each layer
Run once to gather evidence showing WHERE it breaks
THEN analyze evidence to identify failing component
THEN investigate that specific component
Example (multi-layer system):
# Layer 1: Workflow
echo "=== Secrets available in workflow: ==="
echo "IDENTITY: ${IDENTITY:+SET}${IDENTITY:-UNSET}"
# Layer 2: Build script
echo "=== Env vars in build script: ==="
env | grep IDENTITY || echo "IDENTITY not in environment"
# Layer 3: Signing script
echo "=== Keychain state: ==="
security list-keychains
security find-identity -v
# Layer 4: Actual signing
codesign --sign "$IDENTITY" --verbose=4 "$APP"
This reveals: Which layer fails (secrets → workflow ✓, workflow → build ✗)
WHEN error is deep in call stack:
See root-cause-tracing.md in this directory for the complete backward tracing technique.
Quick version:
* Where does bad value originate?
* What called this with bad value?
* Keep tracing up until you find the source
* Fix at source, not at symptom
Find the pattern before fixing:
Find Working Examples
Compare Against References
Identify Differences
Understand Dependencies
Scientific method:
Form Single Hypothesis
Test Minimally
Verify Before Continuing
When You Don't Know
Fix the root cause, not the symptom:
Create Failing Test Case
superpowers:test-driven-development skill for writing proper failing testsImplement Single Fix
Verify Fix
If Fix Doesn't Work
If 3+ Fixes Failed: Question Architecture
Pattern indicating architectural problem:
* Each fix reveals new shared state/coupling/problem in different place
* Fixes require "massive refactoring" to implement
* Each fix creates new symptoms elsewhere
STOP and question fundamentals:
* Is this pattern fundamentally sound?
* Are we "sticking with it through sheer inertia"?
* Should we refactor architecture vs. continue fixing symptoms?
Discuss with your human partner before attempting more fixes
This is NOT a failed hypothesis - this is a wrong architecture.
If you catch yourself thinking:
ALL of these mean: STOP. Return to Phase 1.
If 3+ fixes failed: Question the architecture (see Phase 4.5)
Watch for these redirections:
When you see these: STOP. Return to Phase 1.
| Excuse | Reality |
|---|---|
| "Issue is simple, don't need process" | Simple issues have root causes too. Process is fast for simple bugs. |
| "Emergency, no time for process" | Systematic debugging is FASTER than guess-and-check thrashing. |
| "Just try this first, then investigate" | First fix sets the pattern. Do it right from the start. |
| "I'll write test after confirming fix works" | Untested fixes don't stick. Test first proves it. |
| "Multiple fixes at once saves time" | Can't isolate what worked. Causes new bugs. |
| "Reference too long, I'll adapt the pattern" | Partial understanding guarantees bugs. Read it completely. |
| "I see the problem, let me fix it" | Seeing symptoms ≠ understanding root cause. |
| "One more fix attempt" (after 2+ failures) | 3+ failures = architectural problem. Question pattern, don't fix again. |
| Phase | Key Activities | Success Criteria |
|---|---|---|
| 1. Root Cause | Read errors, reproduce, check changes, gather evidence | Understand WHAT and WHY |
| 2. Pattern | Find working examples, compare | Identify differences |
| 3. Hypothesis | Form theory, test minimally | Confirmed or new hypothesis |
| 4. Implementation | Create test, fix, verify | Bug resolved, tests pass |
If systematic investigation reveals issue is truly environmental, timing-dependent, or external:
But: 95% of "no root cause" cases are incomplete investigation.
These techniques are part of systematic debugging and available in this directory:
root-cause-tracing.md - Trace bugs backward through call stack to find original triggerdefense-in-depth.md - Add validation at multiple layers after finding root causecondition-based-waiting.md - Replace arbitrary timeouts with condition pollingRelated skills:
From debugging sessions:
Weekly Installs
56
Repository
GitHub Stars
518
First Seen
Jan 26, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykPass
Installed on
opencode48
codex46
cursor45
github-copilot45
gemini-cli45
kimi-cli44
Go依赖注入最佳实践:手动注入与库选择指南,提升代码可测试性与松耦合设计
873 周安装