⚠️

重要前提

安装AI Skills的关键前提是：必须科学上网，且开启TUN模式，这一点至关重要，直接决定安装能否顺利完成，在此郑重提醒三遍：科学上网，科学上网，科学上网。查看完整安装教程 →

AI智能体错误恢复协议：系统化解决TOOL_FAILURE、BUILD_ERROR等六类开发错误

error-recovery by mgd34msu/goodvibes-plugin

74 周安装量

6 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/mgd34msu/goodvibes-plugin --skill error-recovery

软件工程自动化调试

🇨🇳中文介绍

资源文件

scripts/
  validate-error-recovery.sh
references/
  common-errors.md

错误恢复协议

当任务失败时，智能体必须遵循一个系统性的恢复流程，在效率和彻底性之间取得平衡。本技能定义了如何对错误进行分类、利用机构记忆、应用多源恢复策略，以及知道何时需要上报。

立即响应

当任务执行过程中发生错误时：

切勿盲目重试。完整阅读错误信息、堆栈跟踪以及任何诊断输出。
将错误归类为以下六种类型之一：
- TOOL_FAILURE -- 精确工具或 MCP 工具返回错误
- BUILD_ERROR -- 构建/编译命令失败（npm、tsc、vite 等）
- TEST_FAILURE -- 测试套件失败（vitest、jest 等）
- TYPE_ERROR -- TypeScript 类型检查失败
- RUNTIME_ERROR -- 代码执行期间崩溃
- EXTERNAL_ERROR -- 第三方服务或 API 故障
检查 .goodvibes/memory/failures.json，使用 precision_read 匹配关键词：
- 从错误信息中搜索关键词
- 如果找到匹配的失败记录，应用文档化的 resolution

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

错误类别：详细指南

错误的文件路径（绝对路径与相对路径、拼写错误、文件不存在）
工具参数语法错误（格式错误的 JSON、不正确的正则表达式）
沙盒阻止外部路径访问
缺少必需参数
工具使用不当（错误的提取模式、错误的输出格式）

重新阅读工具的架构和参数描述
使用 precision_glob 检查文件/路径是否存在
使用 precision_config get 验证沙盒设置（检查 sandbox.enabled）
对于精确工具，检查是否使用了正确的提取/输出模式
首先尝试使用最少的参数进行操作，然后逐步增加复杂性
如果精确工具反复失败，检查是用户错误还是实际工具缺陷：
- 用户错误：参数错误、路径错误、误解工具行为
- 工具缺陷：参数正确但工具崩溃或返回错误结果
对于用户错误，修复并使用精确工具重试
对于工具缺陷，仅针对该特定操作使用原生工具作为后备方案，然后恢复使用精确工具

记忆中的常见模式：

格式/模式不匹配：MCP 架构发送 output.format，处理程序读取 output.mode。始终使用 ?? 检查两者。
Ripgrep 通配符失败：像 src/**/*.ts 这样的模式在使用 ripgrep 时会静默失败。使用正则表达式检测字面前缀并强制使用 fast-glob。
路径规范化：Ripgrep 返回绝对路径或相对路径。在 path.resolve() 之前始终使用 path.isAbsolute()。
沙盒选择加入：仅当 sandbox === true || sandbox === 'true' 时启用。切勿使用 === false 检查。

缺少依赖项（未安装、版本错误）
编辑时未捕获的类型错误
导入错误（路径错误、循环依赖）
配置错误（tsconfig、vite.config 等）
缺少环境变量

阅读完整的构建输出（使用 precision_exec 并配合 expect.exit_code 捕获 stderr）
识别错误链中的第一个错误（后续错误通常是连锁反应）
对于缺少依赖项：检查 package.json，运行 npm install
对于类型错误：参见下面的 TYPE_ERROR 部分
对于导入错误：验证文件是否存在，检查导入路径，查找循环依赖
对于配置错误：与代码库中的工作示例进行比较
搜索正在使用的框架/构建工具的一手文档

错误的测试断言（预期值已更改）
缺少模拟或桩
更改的 API 契约（函数签名、返回类型）
测试环境设置不正确
异步时序问题

阅读测试失败输出（断言错误、预期与实际值）
识别哪个测试文件和具体的测试用例失败
阅读测试文件以了解其测试内容
阅读实现代码，查看其是否符合测试预期
判断是测试错误还是实现错误：
- 如果实现是有意更改 -> 更新测试
- 如果实现被破坏 -> 修复实现
对于异步问题，检查是否缺少 await、不当使用 done() 或存在竞态条件
对于环境问题，检查测试设置文件（vitest.config、jest.setup.js）

不安全的成员访问（obj.prop，其中 obj 可能为 null/undefined）
赋值中的类型不匹配（将 string 赋值给 number）
错误的函数调用签名（缺少参数、类型错误）
不安全的 any 使用逃逸类型系统
违反泛型约束

阅读 TypeScript 错误信息（包含文件、行号和具体类型问题）
如果可用，使用 explain_type_error 分析引擎工具
对于成员访问：添加可选链（?.）或空值检查
对于赋值：在源头修复类型或使用正确的类型断言
对于函数调用：检查函数签名并提供正确的参数
对于 any 逃逸：用正确的类型替换 any
对于泛型：确保类型参数满足约束

添加类型守卫：if (obj && 'prop' in obj)
使用可选链：obj?.prop?.nestedProp
使用可辨识联合缩小类型范围
为自定义守卫使用类型谓词

空值/未定义访问（访问 null 的属性、将 undefined 作为函数调用）
未处理的 Promise 拒绝
未捕获的异常
缺少环境变量
网络故障（fetch、API 调用）
文件系统错误（ENOENT、EACCES）

阅读堆栈跟踪以找到失败的确切行
识别根本原因（空值访问、缺少环境变量、网络问题等）
对于空值/未定义：添加运行时检查或使用可选链
对于 Promise：添加 .catch() 处理程序或在 await 周围使用 try/catch
对于环境变量：检查 .env 文件，确认它们已加载
对于网络问题：添加重试逻辑，检查凭据，验证 URL
对于文件系统问题：验证路径是否存在，检查权限

在 React 组件中添加错误边界
在所有 await 表达式周围使用 try/catch
在启动时验证环境变量
添加防御性检查：if (!value) throw new Error('...')

身份验证过期或无效（401）
超出速率限制（429）
服务宕机或无法访问（503、ECONNREFUSED）
无效的 API 请求（400）
超出配额

检查错误状态码或信息
对于身份验证错误（401）：检查 .goodvibes/secrets/ 中的凭据，验证令牌是否未过期
对于速率限制（429）：实现指数退避，检查是否可以增加配额
对于服务宕机（503）：使用退避策略重试，检查服务状态页面
对于错误请求（400）：阅读 API 文档，验证请求格式
对于配额问题：检查使用情况仪表板，请求增加配额，或等待重置

指数退避：1秒、2秒、4秒、8秒
通过 precision_fetch 检查状态端点以了解服务状态
如果可用，轮换 API 密钥
缓存响应以减少 API 调用

恢复策略：一次性多源查询

在对错误进行分类并检查 failures.json 后，使用一次性策略，即同时（而非顺序）咨询所有知识源，并应用最佳解决方案：

内部知识 -- 你的训练数据、代码库模式（discover、precision_grep、precision_read）、GoodVibes 记忆
一手文档 -- 官方文档、API 参考、更新日志、迁移指南
社区知识 -- Stack Overflow、GitHub Issues、论坛
开放互联网 -- 针对边缘案例的更广泛的网络搜索

应用最佳解决方案

咨询所有来源后：

评估解决方案，依据：
- 时效性（优先选择来自相似版本的解决方案）
- 权威性（官方文档 > 社区 > 随机博客）
- 针对性（完全匹配错误 > 一般性指导）
- 项目契合度（符合你的技术栈和模式）
完整应用解决方案：
- 不要半途而废地应用修复
- 一次性进行所有相关更改
- 使用 precision_edit 进行更改，而不是增量调整
验证修复：
- 重新运行失败的操作
- 使用 precision_exec 运行相关测试
- 检查是否引入了新的错误

成功解决错误后：

1. 记录到 `.goodvibes/memory/failures.json`

使用 precision_edit 追加一个新条目：

{
  "id": "fail_YYYYMMDD_HHMMSS",
  "date": "ISO-8601 timestamp",
  "error": "Brief description of the error",
  "context": "What was being attempted when error occurred",
  "root_cause": "Why it happened (technical explanation)",
  "resolution": "How it was fixed (specific steps taken)",
  "prevention": "How to avoid this in the future",
  "keywords": ["relevant", "search", "terms"]
}

{
  "id": "fail_20260215_143000",
  "date": "2026-02-15T14:30:00Z",
  "error": "precision_read returns 'file not found' for valid path",
  "context": "Reading component file at src/components/Button.tsx",
  "root_cause": "Path was relative, but precision tools require absolute paths. Sandbox was enabled, blocking CWD resolution.",
  "resolution": "RESOLVED - Used absolute path with process.cwd() to create absolute path before calling precision_read.",
  "prevention": "Always use absolute paths with precision tools. Use path.resolve() to convert relative to absolute.",
  "keywords": ["precision_read", "path", "absolute", "relative", "sandbox", "file-not-found"]
}

2. 记录到 `.goodvibes/logs/errors.md`

使用 precision_edit 追加一个 Markdown 条目：

## [YYYY-MM-DD HH:MM] ERROR_CATEGORY: Brief Description

**Error**: Full error message

**Context**: What was being done

**Resolution**: How it was fixed

**Time to resolve**: X minutes

---

3. 继续执行任务

记录完成后，返回原始任务。无需等待确认。

达到最大尝试次数后（3 次尝试）

如果你已尝试修复错误 3 次但仍然失败：

1. 记录未解决的失败

添加到 failures.json，并设置 "resolution": "UNRESOLVED - <what was tried>"：

{
  "id": "fail_20260215_143500",
  "date": "2026-02-15T14:35:00Z",
  "error": "Vitest tests hang indefinitely on CI",
  "context": "Running npm run test in CI pipeline",
  "root_cause": "Unknown - tests pass locally but hang in CI environment",
  "resolution": "UNRESOLVED - Tried: 1) Added --no-watch flag, 2) Increased timeout to 60s, 3) Disabled coverage collection. Tests still hang.",
  "prevention": "Need deeper investigation into CI environment differences",
  "keywords": ["vitest", "ci", "hang", "timeout", "unresolved"]
}

2. 上报给协调器

在你的响应中使用结构化格式：

## Task Status: BLOCKED

### Error
[ERROR_CATEGORY] Brief description

### Attempts Made
1. **Attempt 1**: What was tried -> Result
2. **Attempt 2**: What was tried -> Result
3. **Attempt 3**: What was tried -> Result

### Root Cause Analysis
Best guess at why this is failing based on investigation.

### Suggested Next Steps
- Option 1: [Describe approach that requires different permissions/access]
- Option 2: [Describe alternative architecture/design decision]
- Option 3: [Describe manual intervention needed]

### Files Changed
- `path/to/file.ts` - [what was changed during troubleshooting]

3. 请勿将任务标记为完成

将任务保持在 BLOCKED 状态。协调器将决定是否：

上报给用户
尝试不同的方法
分配给具有不同能力的其他智能体
推迟该任务

立即上报（跳过恢复）

对于某些错误类型，请勿尝试恢复。立即上报给协调器：

EACCES: permission denied
EPERM: operation not permitted

原因：这些需要用户干预来授予权限。智能体无法解决此问题。

上报时附带："遇到权限错误。需要用户：[授予文件访问权限 / 以 sudo 运行 / 更改所有权]。 "

2. 缺少凭据/密钥

Error: OPENAI_API_KEY is not defined
Error: Database connection failed: authentication error
401 Unauthorized

原因：智能体无法创建或修改密钥。只有用户才能提供凭据。

上报时附带："缺少凭据：[ENV_VAR_NAME 或服务名称]。用户需要通过 .env 或密钥管理来提供。"

Context clues indicate multiple valid approaches:
- Use REST API vs GraphQL
- Use Prisma vs Drizzle
- Component composition pattern unclear

原因：这些是影响整个系统的设计决策。智能体不应单方面做出架构选择。

上报时附带："需要架构决策：[描述选择]。选项：[列出 2-3 个选项及其权衡]。 "

4. 发现范围变更

Original task: "Add user profile page"
Discovered: Requires new database schema, auth changes, API endpoints

原因：任务范围比最初定义的要大。协调器需要重新规划。

上报时附带："发现范围扩大。原始任务：[任务]。所需：[新的依赖项/更改]。建议：[分解为子任务 / 获取用户批准]。 "

当发生错误时：

立即分类（TOOL_FAILURE、BUILD_ERROR 等）
检查 failures.json 以查找已知模式
使用一次性多源恢复策略（内部 + 文档 + 社区 + 网络）
完整应用最佳解决方案
将解决方案记录到 failures.json 和 errors.md
继续执行任务

3 次尝试后：

记录为 UNRESOLVED
使用结构化摘要上报给协调器
请勿将任务标记为完成

立即上报以下情况：

权限错误
缺少凭据
架构模糊性
范围变更

精确工具是默认选择，原生工具是后备方案
用户错误 != 工具故障
恢复尝试应是系统性的，而非随机的
记忆是机构知识 -- 使用它并为其做贡献

🇺🇸English

Resources

scripts/
  validate-error-recovery.sh
references/
  common-errors.md

Error Recovery Protocol

When tasks fail, agents must follow a systematic recovery process that balances efficiency with thoroughness. This skill defines how to categorize errors, leverage institutional memory, apply multi-source recovery strategies, and know when to escalate.

Immediate Response

When an error occurs during task execution:

DO NOT retry blindly. Read the full error message, stack trace, and any diagnostic output.
Categorize the error into one of six types:
- TOOL_FAILURE -- Precision tool or MCP tool returned an error
- BUILD_ERROR -- Build/compile command failed (npm, tsc, vite, etc.)
- TEST_FAILURE -- Test suite failed (vitest, jest, etc.)
- TYPE_ERROR -- TypeScript type checking failed
- RUNTIME_ERROR -- Code crashed during execution
- EXTERNAL_ERROR -- Third-party service or API failure
Check.goodvibes/memory/failures.json for matching keywords using precision_read:
- Search for keywords from the error message
- If a matching failure is found, apply the documented resolution
- If the resolution doesn't work, note it and proceed to recovery phases
- If not found, proceed directly to recovery phases

Error Categories: Detailed Guidance

TOOL_FAILURE

Common Causes:

Wrong file path (absolute vs relative, typo, file doesn't exist)
Bad syntax in tool parameters (malformed JSON, incorrect regex)
Sandbox blocking external paths
Missing required parameters
Tool used incorrectly (wrong extract mode, wrong output format)

Recovery Steps:

Re-read the tool's schema and parameter descriptions
Check if the file/path exists using precision_glob
Verify sandbox settings with precision_config get (check sandbox.enabled)
For precision tools, check if you're using the right extract/output mode
Try the operation with minimal parameters first, then add complexity
If precision tool fails repeatedly, check if it's a user error vs actual tool bug:
- User error: wrong params, bad path, misunderstood tool behavior
- Tool bug: correct params but tool crashes or returns wrong result
For user errors, fix and retry with precision tools
For tool bugs, use native tool fallback ONLY for that specific operation, then return to precision tools

Common Patterns from Memory:

Format/mode mismatch : MCP schema sends output.format, handlers read output.mode. Always check both with ??.
Ripgrep glob failures : Patterns like src/**/*.ts fail silently with ripgrep. Use regex to detect literal prefixes and force fast-glob.
Path normalization : Ripgrep returns absolute OR relative paths. Always use path.isAbsolute() before path.resolve().
Sandbox opt-in : Only enable when sandbox === true || sandbox === 'true'. Never use === false checks.

BUILD_ERROR

Common Causes:

Missing dependencies (not installed, wrong version)
Type errors not caught during editing
Import errors (wrong path, circular dependency)
Configuration errors (tsconfig, vite.config, etc.)
Environment variable missing

Recovery Steps:

Read the full build output (use precision_exec with expect.exit_code to capture stderr)
Identify the first error in the chain (subsequent errors are often cascading)
For missing deps: Check package.json, run npm install
For type errors: See TYPE_ERROR section below
For import errors: Verify file exists, check import path, look for circular deps
For config errors: Compare with working examples in the codebase
Search first-party docs for the framework/build tool being used

TEST_FAILURE

Common Causes:

Wrong test assertions (expected value changed)
Missing mocks or stubs
Changed API contract (function signature, return type)
Test environment not set up correctly
Async timing issues

Recovery Steps:

Read the test failure output (assertion error, expected vs actual)
Identify which test file and specific test case failed
Read the test file to understand what it's testing
Read the implementation to see if it matches the test expectations
Determine if the test is wrong or the implementation is wrong:
- If implementation changed intentionally -> update the test
- If implementation broke -> fix the implementation
For async issues, check for missing await, improper use of done(), or race conditions
For environment issues, check test setup files (vitest.config, jest.setup.js)

TYPE_ERROR

Common Causes:

Unsafe member access (obj.prop where obj could be null/undefined)
Type mismatch in assignments (string assigned to number)
Wrong function call signature (missing params, wrong types)
Unsafe any usage escaping type system
Generic constraints violated

Recovery Steps:

Read the TypeScript error message (it includes file, line, and specific type issue)
Use the explain_type_error analysis-engine tool if available
For member access: Add optional chaining (?.) or null check
For assignments: Fix the type at the source or use proper type assertion
For function calls: Check the function signature and provide correct arguments
For any escapes: Replace any with proper types
For generics: Ensure type parameters satisfy constraints

Common Patterns:

Add type guards: if (obj && 'prop' in obj)
Use optional chaining: obj?.prop?.nestedProp
Narrow types with discriminated unions
Use type predicates for custom guards

RUNTIME_ERROR

Common Causes:

Null/undefined access (property of null, calling undefined as function)
Unhandled promise rejections
Uncaught exceptions
Missing environment variables
Network failures (fetch, API calls)
File system errors (ENOENT, EACCES)

Recovery Steps:

Read the stack trace to find the exact line that failed
Identify the root cause (null access, missing env var, network, etc.)
For null/undefined: Add runtime checks or use optional chaining
For promises: Add .catch() handlers or try/catch around await
For env vars: Check .env files, verify they're loaded
For network: Add retry logic, check credentials, verify URL
For file system: Verify paths exist, check permissions

Common Patterns:

Add error boundaries in React components
Use try/catch around all await expressions
Validate env vars at startup
Add defensive checks: if (!value) throw new Error('...')

EXTERNAL_ERROR

Common Causes:

Authentication expired or invalid (401)
Rate limit exceeded (429)
Service down or unreachable (503, ECONNREFUSED)
Invalid API request (400)
Quota exceeded

Recovery Steps:

Check the error status code or message
For auth errors (401): Check credentials in .goodvibes/secrets/, verify token hasn't expired
For rate limits (429): Implement exponential backoff, check if quota can be increased
For service down (503): Retry with backoff, check service status page
For bad requests (400): Read API docs, verify request format
For quota: Check usage dashboard, request increase, or wait for reset

Common Patterns:

Exponential backoff: 1s, 2s, 4s, 8s
Check service status via precision_fetch to status endpoint
Rotate API keys if multiple are available
Cache responses to reduce API calls

Recovery Strategy: One-Shot Multi-Source

After categorizing the error and checking failures.json, use a one-shot strategy where you consult ALL knowledge sources simultaneously (not sequentially) and apply the best solution:

Internal knowledge -- your training data, codebase patterns (discover, precision_grep, precision_read), GoodVibes memory
First-party docs -- official documentation, API references, changelogs, migration guides
Community knowledge -- Stack Overflow, GitHub Issues, forums
Open internet -- broader web search for edge cases

Applying the Best Solution

Once you've consulted all sources:

Evaluate solutions based on:
- Recency (prefer solutions from similar versions)
- Authority (official docs > community > random blog)
- Specificity (exact error match > general guidance)
- Project fit (matches your stack and patterns)
Apply the solution completely:
- Don't half-apply a fix
- Make all related changes at once
- Use precision_edit for changes, not incremental tweaks
Validate the fix :
- Re-run the failing operation
- Run related tests with precision_exec
- Check for new errors introduced

After Resolution

Once you've successfully resolved the error:

1. Log to `.goodvibes/memory/failures.json`

Use precision_edit to append a new entry with:

{
  "id": "fail_YYYYMMDD_HHMMSS",
  "date": "ISO-8601 timestamp",
  "error": "Brief description of the error",
  "context": "What was being attempted when error occurred",
  "root_cause": "Why it happened (technical explanation)",
  "resolution": "How it was fixed (specific steps taken)",
  "prevention": "How to avoid this in the future",
  "keywords": ["relevant", "search", "terms"]
}

Example:

{
  "id": "fail_20260215_143000",
  "date": "2026-02-15T14:30:00Z",
  "error": "precision_read returns 'file not found' for valid path",
  "context": "Reading component file at src/components/Button.tsx",
  "root_cause": "Path was relative, but precision tools require absolute paths. Sandbox was enabled, blocking CWD resolution.",
  "resolution": "RESOLVED - Used absolute path with process.cwd() to create absolute path before calling precision_read.",
  "prevention": "Always use absolute paths with precision tools. Use path.resolve() to convert relative to absolute.",
  "keywords": ["precision_read", "path", "absolute", "relative", "sandbox", "file-not-found"]
}

2. Log to `.goodvibes/logs/errors.md`

Append a markdown entry using precision_edit:

## [YYYY-MM-DD HH:MM] ERROR_CATEGORY: Brief Description

**Error**: Full error message

**Context**: What was being done

**Resolution**: How it was fixed

**Time to resolve**: X minutes

---

3. Continue with the task

Once logged, return to the original task. Don't wait for confirmation.

After Max Attempts (3 attempts)

If you've tried to fix the error 3 times and it's still failing:

1. Log the unresolved failure

Add to failures.json with "resolution": "UNRESOLVED - <what was tried>":

{
  "id": "fail_20260215_143500",
  "date": "2026-02-15T14:35:00Z",
  "error": "Vitest tests hang indefinitely on CI",
  "context": "Running npm run test in CI pipeline",
  "root_cause": "Unknown - tests pass locally but hang in CI environment",
  "resolution": "UNRESOLVED - Tried: 1) Added --no-watch flag, 2) Increased timeout to 60s, 3) Disabled coverage collection. Tests still hang.",
  "prevention": "Need deeper investigation into CI environment differences",
  "keywords": ["vitest", "ci", "hang", "timeout", "unresolved"]
}

2. Report to orchestrator

Use structured format in your response:

## Task Status: BLOCKED

### Error
[ERROR_CATEGORY] Brief description

### Attempts Made
1. **Attempt 1**: What was tried -> Result
2. **Attempt 2**: What was tried -> Result
3. **Attempt 3**: What was tried -> Result

### Root Cause Analysis
Best guess at why this is failing based on investigation.

### Suggested Next Steps
- Option 1: [Describe approach that requires different permissions/access]
- Option 2: [Describe alternative architecture/design decision]
- Option 3: [Describe manual intervention needed]

### Files Changed
- `path/to/file.ts` - [what was changed during troubleshooting]

3. Do NOT mark task as complete

Leave the task in BLOCKED state. The orchestrator will decide whether to:

Escalate to user
Try a different approach
Assign to a different agent with different capabilities
Defer the task

Immediate Escalation (Skip Recovery)

For certain error types, do NOT attempt recovery. Escalate immediately to the orchestrator:

1. Permission Errors

EACCES: permission denied
EPERM: operation not permitted

Why : These require user intervention to grant permissions. Agents can't fix this.

Escalate with : "Permission error encountered. Need user to: [grant file access / run as sudo / change ownership]."

2. Missing Credentials/Secrets

Error: OPENAI_API_KEY is not defined
Error: Database connection failed: authentication error
401 Unauthorized

Why : Agents can't create or modify secrets. Only users can provide credentials.

Escalate with : "Missing credential: [ENV_VAR_NAME or service name]. User needs to provide via .env or secrets management."

3. Architectural Ambiguity

Context clues indicate multiple valid approaches:
- Use REST API vs GraphQL
- Use Prisma vs Drizzle
- Component composition pattern unclear

Why : These are design decisions that affect the broader system. Agents shouldn't make architectural choices unilaterally.

Escalate with : "Architectural decision required: [describe the choice]. Options: [list 2-3 options with tradeoffs]."

4. Scope Change Discovered

Original task: "Add user profile page"
Discovered: Requires new database schema, auth changes, API endpoints

Why : The task is larger than originally scoped. Orchestrator needs to re-plan.

Escalate with : "Scope expansion discovered. Original: [task]. Required: [new dependencies/changes]. Recommend: [break into subtasks / get user approval]."

Summary

When errors occur:

Categorize immediately (TOOL_FAILURE, BUILD_ERROR, etc.)
Check failures.json for known patterns
Use one-shot multi-source recovery (internal + docs + community + web)
Apply the best solution completely
Log resolution to failures.json and errors.md
Continue with task

After 3 attempts:

Log as UNRESOLVED
Report to orchestrator with structured summary
Do NOT mark task complete

Escalate immediately for:

Permission errors
Missing credentials
Architectural ambiguity
Scope changes

Remember:

Precision tools are the default, native tools are the fallback
User error != tool failure
Recovery attempts should be systematic, not random
Memory is institutional knowledge -- use it and contribute to it

Weekly Installs

Repository

mgd34msu/goodvi…s-plugin

GitHub Stars

First Seen

Feb 17, 2026

Security Audits

Gen Agent Trust HubPass SocketPass SnykWarn

Installed on

codex70

opencode70

github-copilot69

kimi-cli69

gemini-cli69

amp69

Skills CLI 使用指南：AI Agent 技能包管理器安装与管理教程

48,700 周安装

AI智能体错误恢复协议：系统化解决TOOL_FAILURE、BUILD_ERROR等六类开发错误

🇨🇳中文介绍

资源文件

错误恢复协议

立即响应

相关 Skills

错误类别：详细指南

TOOL_FAILURE

BUILD_ERROR

TEST_FAILURE

TYPE_ERROR

RUNTIME_ERROR

EXTERNAL_ERROR

恢复策略：一次性多源查询

应用最佳解决方案

解决之后

1. 记录到 .goodvibes/memory/failures.json

2. 记录到 .goodvibes/logs/errors.md

3. 继续执行任务

达到最大尝试次数后（3 次尝试）

1. 记录未解决的失败

2. 上报给协调器

3. 请勿将任务标记为完成

立即上报（跳过恢复）

1. 权限错误

2. 缺少凭据/密钥

3. 架构模糊性

4. 发现范围变更

总结

🇺🇸English

Resources

Error Recovery Protocol

Immediate Response

Error Categories: Detailed Guidance

TOOL_FAILURE

BUILD_ERROR

TEST_FAILURE

TYPE_ERROR

RUNTIME_ERROR

EXTERNAL_ERROR

Recovery Strategy: One-Shot Multi-Source

Applying the Best Solution

After Resolution

1. Log to .goodvibes/memory/failures.json

2. Log to .goodvibes/logs/errors.md

3. Continue with the task

After Max Attempts (3 attempts)

1. Log the unresolved failure

2. Report to orchestrator

3. Do NOT mark task as complete

Immediate Escalation (Skip Recovery)

1. Permission Errors

2. Missing Credentials/Secrets

3. Architectural Ambiguity

4. Scope Change Discovered

Summary

最新 Skills

1. 记录到 `.goodvibes/memory/failures.json`

2. 记录到 `.goodvibes/logs/errors.md`

1. Log to `.goodvibes/memory/failures.json`

2. Log to `.goodvibes/logs/errors.md`