⚠️

重要前提

安装AI Skills的关键前提是：必须科学上网，且开启TUN模式，这一点至关重要，直接决定安装能否顺利完成，在此郑重提醒三遍：科学上网，科学上网，科学上网。查看完整安装教程 →

软件开发验证原则：完成前必须验证，杜绝虚假声明 | 代码质量与测试规范

verification-before-completion by oimiragieo/agent-studio

59 周安装量

24 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/oimiragieo/agent-studio --skill verification-before-completion

软件工程代码质量测试

🇨🇳中文介绍

完成前的验证

概述

未经验证就声称工作已完成是不诚实的行为，而非高效。

核心原则： 先有证据，再作声明，始终如此。

违反此规则的字面意思即是违反其精神。

铁律

NO COMPLETION CLAIMS WITHOUT FRESH VERIFICATION EVIDENCE

如果你没有在当前消息中运行验证命令，就不能声称它通过了。

闸门函数

BEFORE claiming any status or expressing satisfaction:

1. IDENTIFY: What command proves this claim?
2. RUN: Execute the FULL command (fresh, complete)
3. READ: Full output, check exit code, count failures
4. VERIFY: Does output confirm the claim?
   - If NO: State actual status with evidence
   - If YES: State claim WITH evidence
5. ONLY THEN: Make the claim

Skip any step = lying, not verifying

常见失败

声明	要求	不足够
测试通过	测试命令输出：0 失败

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

危险信号 - 停止

使用"应该"、"可能"、"似乎"
在验证前表达满意（"太好了！"、"完美！"、"完成！"等）
即将提交/推送/创建 PR 而未经验证
信任 agent 的成功报告
依赖部分验证
想着"就这一次"
感到疲倦并希望工作结束
任何暗示成功但未运行验证的措辞

借口	现实
"现在应该可以了"	运行验证
"我有信心"	信心不等于证据
"就这一次"	没有例外
"Linter 通过了"	Linter 不等于编译器
"Agent 说成功了"	独立验证
"我累了"	疲惫不等于借口
"部分检查就够了"	部分证明不了什么
"措辞不同所以规则不适用"	精神重于字面

CORRECT: [Run test command] [See: 34/34 pass] "All tests pass"
WRONG: "Should pass now" / "Looks correct"

回归测试（TDD 红绿循环）：

CORRECT: Write -> Run (pass) -> Revert fix -> Run (MUST FAIL) -> Restore -> Run (pass)
WRONG: "I've written a regression test" (without red-green verification)

CORRECT: [Run build] [See: exit 0] "Build passes"
WRONG: "Linter passed" (linter doesn't check compilation)

Lint 和格式化（阻塞门禁）：

CORRECT: [Run pnpm lint:fix] [See: 0 errors] [Run pnpm format] [See: no changes] "Lint and format clean"
WRONG: "Code looks formatted" / "No obvious lint issues" / "Should be clean"

CORRECT: Re-read plan -> Create checklist -> Verify each -> Report gaps or completion
WRONG: "Tests pass, phase complete"

CORRECT: Agent reports success -> Check VCS diff -> Verify changes -> Report actual state
WRONG: Trust agent report

来自失败分析：

"我不相信你" - 信任破裂
未定义的函数被发布 - 会导致崩溃
缺失的需求被发布 - 功能不完整
时间浪费在虚假完成 -> 重定向 -> 返工
违反："诚实是核心价值。如果你撒谎，你将被替换。"

总是在以下情况之前：

任何形式的成功/完成声明
任何表达满意的行为
任何关于工作状态的正面陈述
提交、创建 PR、任务完成
转向下一个任务
委托给 agents

规则适用于：

确切的短语
释义和同义词
成功的暗示
任何暗示完成/正确性的沟通

验证没有捷径。

运行命令。阅读输出。然后声明结果。

这是不可协商的。

绝不在当前会话中未运行新的验证命令的情况下声称任务完成
总是在断言结果之前阅读完整的命令输出——不仅仅是最后一行或退出码
绝不使用模糊语言（"应该通过"、"可能有效"）作为运行验证的替代品
总是应用红绿重构循环：验证测试失败，修复后通过，还原后再次失败
绝不在未经证实的证据表明所有门禁（测试、lint、格式化）都通过的情况下提交、推送或关闭任务

反模式	为何失败	正确方法
在运行命令前声称成功	没有证据表明声明为真	运行命令，阅读完整输出，然后声明
信任之前运行的结果	状态可能自上次执行后已改变	总是在当前会话中运行新的验证
部分验证（测试了但没 lint）	Lint 失败即使测试通过也会导致 CI 失败	运行所有门禁：测试、`pnpm lint:fix`、`pnpm format`
使用"应该"或"可能"等语言	暗示假设，而非验证	消除模糊措辞；验证然后陈述事实
跳过红绿重构循环	总是通过的回归测试无法捕获任何问题	在标记完成前验证还原后测试失败

记忆协议（强制）

开始前： 阅读 .claude/context/memory/learnings.md

新模式 -> .claude/context/memory/learnings.md
发现的问题 -> .claude/context/memory/issues.md
做出的决定 -> .claude/context/memory/decisions.md

假设中断：如果不在记忆中，就等于没发生。

🇺🇸English

Verification Before Completion

Overview

Claiming work is complete without verification is dishonesty, not efficiency.

Core principle: Evidence before claims, always.

Violating the letter of this rule is violating the spirit of this rule.

The Iron Law

NO COMPLETION CLAIMS WITHOUT FRESH VERIFICATION EVIDENCE

If you haven't run the verification command in this message, you cannot claim it passes.

The Gate Function

BEFORE claiming any status or expressing satisfaction:

1. IDENTIFY: What command proves this claim?
2. RUN: Execute the FULL command (fresh, complete)
3. READ: Full output, check exit code, count failures
4. VERIFY: Does output confirm the claim?
   - If NO: State actual status with evidence
   - If YES: State claim WITH evidence
5. ONLY THEN: Make the claim

Skip any step = lying, not verifying

Common Failures

Claim	Requires	Not Sufficient
Tests pass	Test command output: 0 failures	Previous run, "should pass"
Linter clean	`pnpm lint:fix` output: 0 errors	Partial check, extrapolation
Format clean	`pnpm format` output: no changes	Visual inspection, assumption
Build succeeds	Build command: exit 0	Linter passing, logs look good
Bug fixed	Test original symptom: passes	Code changed, assumed fixed
Regression test works	Red-green cycle verified	Test passes once
Agent completed	VCS diff shows changes	Agent reports "success"
Requirements met	Line-by-line checklist	Tests passing
Code quality gates	`pnpm lint:fix` + `pnpm format` passed	Tests passing

Red Flags - STOP

Using "should", "probably", "seems to"
Expressing satisfaction before verification ("Great!", "Perfect!", "Done!", etc.)
About to commit/push/PR without verification
Trusting agent success reports
Relying on partial verification
Thinking "just this once"
Tired and wanting work over
ANY wording implying success without having run verification

Rationalization Prevention

Excuse	Reality
"Should work now"	RUN the verification
"I'm confident"	Confidence does not equal evidence
"Just this once"	No exceptions
"Linter passed"	Linter does not equal compiler
"Agent said success"	Verify independently
"I'm tired"	Exhaustion does not equal excuse
"Partial check is enough"	Partial proves nothing
"Different words so rule doesn't apply"	Spirit over letter

Key Patterns

Tests:

CORRECT: [Run test command] [See: 34/34 pass] "All tests pass"
WRONG: "Should pass now" / "Looks correct"

Regression tests (TDD Red-Green):

CORRECT: Write -> Run (pass) -> Revert fix -> Run (MUST FAIL) -> Restore -> Run (pass)
WRONG: "I've written a regression test" (without red-green verification)

Build:

CORRECT: [Run build] [See: exit 0] "Build passes"
WRONG: "Linter passed" (linter doesn't check compilation)

Lint and Format (BLOCKING GATE):

CORRECT: [Run pnpm lint:fix] [See: 0 errors] [Run pnpm format] [See: no changes] "Lint and format clean"
WRONG: "Code looks formatted" / "No obvious lint issues" / "Should be clean"

Requirements:

CORRECT: Re-read plan -> Create checklist -> Verify each -> Report gaps or completion
WRONG: "Tests pass, phase complete"

Agent delegation:

CORRECT: Agent reports success -> Check VCS diff -> Verify changes -> Report actual state
WRONG: Trust agent report

Why This Matters

From failure analysis:

"I don't believe you" - trust broken
Undefined functions shipped - would crash
Missing requirements shipped - incomplete features
Time wasted on false completion -> redirect -> rework
Violates: "Honesty is a core value. If you lie, you'll be replaced."

When To Apply

ALWAYS before:

ANY variation of success/completion claims
ANY expression of satisfaction
ANY positive statement about work state
Committing, PR creation, task completion
Moving to next task
Delegating to agents

Rule applies to:

Exact phrases
Paraphrases and synonyms
Implications of success
ANY communication suggesting completion/correctness

The Bottom Line

No shortcuts for verification.

Run the command. Read the output. THEN claim the result.

This is non-negotiable.

Iron Laws

NEVER claim task completion without running fresh verification commands in the current session
ALWAYS read the full command output before asserting a result — not just the last line or exit code
NEVER use hedging language ("should pass", "probably works") as a substitute for running verification
ALWAYS apply the red-green-refactor cycle: verify test fails, fix passes, revert fails again
NEVER commit, push, or close a task without verified evidence that all gates (tests, lint, format) pass

Anti-Patterns

Anti-Pattern	Why It Fails	Correct Approach
Claiming success before running commands	No evidence the claim is true	Run the command, read full output, then claim
Trusting results from a prior run	State may have changed since last execution	Always run fresh verification in the current session
Partial verification (tests but not lint)	Lint failures fail CI even when tests pass	Run all gates: tests, `pnpm lint:fix`, `pnpm format`
Using "should" or "probably" language	Implies assumption, not verification	Eliminate hedging; verify then state the fact
Skipping red-green-refactor cycle	Regression tests that always pass catch nothing	Verify test fails on revert before marking complete

Memory Protocol (MANDATORY)

Before starting: Read .claude/context/memory/learnings.md

After completing:

New pattern -> .claude/context/memory/learnings.md
Issue found -> .claude/context/memory/issues.md
Decision made -> .claude/context/memory/decisions.md

ASSUME INTERRUPTION: If it's not in memory, it didn't happen.

Weekly Installs

Repository

oimiragieo/agent-studio

GitHub Stars

First Seen

Jan 27, 2026

Security Audits

Gen Agent Trust HubPass SocketPass SnykPass

Installed on

github-copilot56

gemini-cli55

kimi-cli54

amp54

codex54

opencode54

后端测试指南：API端点、业务逻辑与数据库测试最佳实践

11,800 周安装