code-maturity-assessor by trailofbits/skills
npx skills add https://github.com/trailofbits/skills --skill code-maturity-assessor使用 Trail of Bits 的 9 类别框架系统性地评估代码库成熟度。提供基于证据的评级和可操作的建议。
框架 : Building Secure Contracts - Code Maturity Evaluation v0.1.0
探索代码库以了解:
对于 9 个类别中的每一个,我将:
生成:
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
评级逻辑 :
我评估涵盖代码成熟度所有方面的 9 个综合类别。有关详细标准、分析方法和评级阈值,请参阅 ASSESSMENT_CRITERIA.md。
1. 算术
2. 审计
3. 身份验证 / 访问控制
4. 复杂性管理
5. 去中心化
6. 文档
7. 交易排序风险
8. 低级操作
9. 测试与验证
有关完整的评估标准,包括我将分析的内容、我将询问您的问题以及详细的评级阈值(薄弱/中等/良好/优秀),请参阅 ASSESSMENT_CRITERIA.md。
评估完成后,您将收到一份全面的成熟度报告,包括:
有关完整的示例评估报告,请参阅 EXAMPLE_REPORT.md。
调用时,我将:
| 错误认知 | 为何错误 | 必需行动 |
|---|---|---|
| "发现了一些问题,评估完成" | 评估需要评估所有 9 个类别 | 完成所有 9 个类别的评估,并为每个类别提供证据 |
| "我看到了事件,审计类别看起来不错" | 仅有事件不等于审计成熟度 | 检查日志的全面性、测试、事件响应流程 |
| "代码看起来简单,复杂度低" | 视觉上的简单掩盖了组合复杂度 | 分析圈复杂度、依赖深度、状态机转换 |
| "不是 DeFi 协议,MEV 类别不适用" | MEV 超出了 DeFi 范围(治理、NFT、游戏) | 在声明不适用之前,通过交易排序分析进行验证 |
| "未找到汇编,低级类别不适用" | 低级风险包括外部调用、delegatecall、内联汇编 | 在跳过类别之前,搜索所有低级模式 |
| "这花费太长时间了" | 彻底的评估需要每个类别都花费时间 | 完成所有 9 个类别,询问有关链下流程的澄清性问题 |
| "我可以在没有证据的情况下进行评级" | 没有文件:行号引用的评级 = 无根据的主张 | 为每个类别评估收集具体的代码证据 |
| "用户会知道如何改进" | 模糊的指导 = 没有行动 | 提供按优先级排序的路线图,包含具体的改进措施和工作量估算 |
有关详细的报告结构和模板,请参阅 REPORT_FORMAT.md。
预计时间 : 30-40 分钟
我需要 :
让我们开始评估这个代码库!
每周安装量
1.2K
代码仓库
GitHub 星标数
3.9K
首次出现
Jan 19, 2026
安全审计
安装于
claude-code1.0K
opencode984
gemini-cli963
codex958
cursor941
github-copilot902
Systematically assesses codebase maturity using Trail of Bits' 9-category framework. Provides evidence-based ratings and actionable recommendations.
Framework : Building Secure Contracts - Code Maturity Evaluation v0.1.0
Explores the codebase to understand:
For each of 9 categories, I'll:
Generates:
Rating Logic :
I assess 9 comprehensive categories covering all aspects of code maturity. For detailed criteria, analysis approaches, and rating thresholds, see ASSESSMENT_CRITERIA.md.
1. ARITHMETIC
2. AUDITING
3. AUTHENTICATION / ACCESS CONTROLS
4. COMPLEXITY MANAGEMENT
5. DECENTRALIZATION
6. DOCUMENTATION
7. TRANSACTION ORDERING RISKS
8. LOW-LEVEL MANIPULATION
9. TESTING & VERIFICATION
For complete assessment criteria including what I'll analyze, what I'll ask you, and detailed rating thresholds (WEAK/MODERATE/SATISFACTORY/STRONG), see ASSESSMENT_CRITERIA.md.
When the assessment is complete, you'll receive a comprehensive maturity report including:
For a complete example assessment report, see EXAMPLE_REPORT.md.
When invoked, I will:
Explore codebase
Analyze each category
Interactive assessment
Generate report
| Rationalization | Why It's Wrong | Required Action |
|---|---|---|
| "Found some findings, assessment complete" | Assessment requires evaluating ALL 9 categories | Complete assessment of all 9 categories with evidence for each |
| "I see events, auditing category looks good" | Events alone don't equal auditing maturity | Check logging comprehensiveness, testing, incident response processes |
| "Code looks simple, complexity is low" | Visual simplicity masks composition complexity | Analyze cyclomatic complexity, dependency depth, state machine transitions |
| "Not a DeFi protocol, MEV category doesn't apply" | MEV extends beyond DeFi (governance, NFTs, games) | Verify with transaction ordering analysis before declaring N/A |
| "No assembly found, low-level category is N/A" | Low-level risks include external calls, delegatecall, inline assembly | Search for all low-level patterns before skipping category |
| "This is taking too long" | Thorough assessment requires time per category | Complete all 9 categories, ask clarifying questions about off-chain processes |
| "I can rate this without evidence" | Ratings without file:line references = unsubstantiated claims |
For detailed report structure and templates, see REPORT_FORMAT.md.
Executive Summary
Maturity Scorecard
Detailed Analysis
Improvement Roadmap
Estimated Time : 30-40 minutes
I'll need :
Let's assess this codebase!
Weekly Installs
1.2K
Repository
GitHub Stars
3.9K
First Seen
Jan 19, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykPass
Installed on
claude-code1.0K
opencode984
gemini-cli963
codex958
cursor941
github-copilot902
| Collect concrete code evidence for every category assessment |
| "User will know what to improve" | Vague guidance = no action | Provide priority-ordered roadmap with specific improvements and effort estimates |