重要前提
安装AI Skills的关键前提是:必须科学上网,且开启TUN模式,这一点至关重要,直接决定安装能否顺利完成,在此郑重提醒三遍:科学上网,科学上网,科学上网。查看完整安装教程 →
recursive-improvement by dylanfeltus/skills
npx skills add https://github.com/dylanfeltus/skills --skill recursive-improvement一种通过依据明确评分标准进行迭代来生成更高质量输出的模式。
generate → evaluate → diagnose → improve → repeat (until passing)
切勿将初稿输出用于重要内容。 运行此循环。
像往常一样创建初始输出。
根据每条标准(1-10分)对输出进行评分。务必绝对诚实。
对于任何低于阈值的标准:
针对每个诊断出的弱点进行重写。不要打补丁——重建薄弱部分。
重新评估。持续循环,直到所有标准都达到阈值(通常最低8/10分)。
在通过标准后,从敌对角度攻击输出:
如果它能经受住考验,就发布它。如果不能,就继续迭代。
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
| 标准 | 评估内容 |
|---|---|
| 吸引力强度 | 第一行能抓住注意力吗?有模式中断吗? |
| 好奇心缺口 | 能产生继续阅读的冲动吗? |
| 清晰度 | 一个清晰的观点?没有混淆? |
| 声音匹配度 | 听起来像目标声音/品牌吗? |
| 互动潜力 | 人们会回复/分享/收藏吗? |
| 拇指停留力 | 浏览者会停下来吗? |
| 价值密度 | 每一行都有其存在的价值吗? |
| 行动号召清晰度 | 读者接下来该做什么很清楚吗? |
对抗性测试: 一个在晚上11点注意力分散、持怀疑态度的用户会与这个互动吗?
| 标准 | 评估内容 |
|---|---|
| 标题清晰度 | 能立刻明白这个业务是做什么的吗? |
| 价值主张强度 | 为什么选择他们而不是竞争对手? |
| 利益焦点 | 功能是否转化为客户利益? |
| 行动号召有效性 | 行动是否清晰、有说服力?摩擦低吗? |
| 信任信号 | 建立了可信度吗?有社会证明吗? |
| 可读性 | 易于浏览吗?段落简短吗?层次清晰吗? |
| 异议处理 | 解决了常见的担忧吗? |
| 具体性 | 是具体的细节还是模糊的声明? |
对抗性测试: 一个用手机搜索的人会在30秒内采取行动吗?
| 标准 | 评估内容 |
|---|---|
| 主题行 | 这个会被打开吗?在收件箱中突出吗? |
| 开场吸引力 | 第一句话能赢得第二句话的机会吗? |
| 单一焦点 | 每封邮件只有一个明确的请求吗? |
| 可快速浏览性 | 能在5秒内了解要点吗? |
| 行动号召突出性 | 行动明显且容易吗? |
| 声音一致性 | 与品牌/发件人个性匹配吗? |
| 长度合适 | 没有废话,没有遗漏吗? |
| 移动端友好 | 在小屏幕上能正常显示吗? |
对抗性测试: 一个有200封未读邮件的忙碌人士会对此采取行动吗?
| 标准 | 评估内容 |
|---|---|
| 拇指停留力 | 前2秒内有模式中断吗? |
| 好奇心缺口 | 产生了需要了解更多信息的欲望吗? |
| 情感触发点 | 击中了真正的痛点或欲望吗? |
| 可信度 | 可信吗?不是好得令人难以置信吗? |
| 行动号召强度 | 下一步清晰且摩擦低吗? |
| 用户画像匹配度 | 直接针对目标受众说话吗? |
| 差异化 | 与竞争对手的广告相比突出吗? |
| 平台原生性 | 符合平台的风格/格式吗? |
对抗性测试: 这会让你停止滚动吗?你会点击吗?
始终用于:
可以跳过用于:
## 输出 v1
[初始生成内容]
## 评估 v1
- 吸引力强度: 6/10 — 开场薄弱,无模式中断
- 清晰度: 8/10 — 足够清晰
- 声音匹配度: 7/10 — 过于正式
[... 对所有标准评分]
## 诊断
1. 吸引点需要一个令人惊讶的数据或相反的观点
2. 声音应该更随意,句子更短
3. [...]
## 输出 v2
[针对弱点修订后的版本]
## 评估 v2
[重新评分——持续直到全部通过]
循环通常会增加2-3次迭代。对于任何重要的事情都是值得的。
每周安装数
68
代码仓库
GitHub 星标数
174
首次出现
2026年2月19日
安全审计
安装于
opencode59
codex57
gemini-cli57
github-copilot51
claude-code51
cursor50
A pattern for generating higher-quality output by iterating against explicit scoring criteria.
generate → evaluate → diagnose → improve → repeat (until passing)
Never ship first-draft output for important content. Run the loop.
Create the initial output as you normally would.
Score the output against each criterion (1-10). Be brutally honest.
For any criterion scoring below threshold:
Rewrite addressing each diagnosed weakness. Don't patch — rebuild the weak sections.
Re-evaluate. Keep looping until all criteria pass threshold (usually 8/10 minimum).
After passing criteria, attack the output from a hostile perspective:
If it survives, ship it. If not, iterate.
| Criterion | What to evaluate |
|---|---|
| Hook strength | First line grabs attention? Pattern interrupt? |
| Curiosity gap | Creates urge to keep reading? |
| Clarity | One clear idea? No confusion? |
| Voice match | Sounds like the target voice/brand? |
| Engagement potential | People will reply/share/save? |
| Thumb-stop power | Scroller would pause? |
| Value density | Every line earns its place? |
| CTA clarity | Clear what reader should do next? |
Adversarial test: Would a distracted, skeptical user at 11pm engage with this?
| Criterion | What to evaluate |
|---|---|
| Headline clarity | Instantly clear what this business does? |
| Value prop strength | Why choose them over competitors? |
| Benefit focus | Features translated to customer benefits? |
| CTA effectiveness | Clear, compelling action? Low friction? |
| Trust signals | Credibility established? Social proof? |
| Readability | Scannable? Short paragraphs? Clear hierarchy? |
| Objection handling | Common concerns addressed? |
| Specificity | Concrete details vs vague claims? |
Adversarial test: Would someone searching on their phone take action within 30 seconds?
| Criterion | What to evaluate |
|---|---|
| Subject line | Would this get opened? Stands out in inbox? |
| Opening hook | First sentence earns the second? |
| Single focus | One clear ask per email? |
| Skimmability | Can get the gist in 5 seconds? |
| CTA prominence | Action is obvious and easy? |
| Voice consistency | Matches brand/sender personality? |
| Length appropriate | No fluff, nothing missing? |
| Mobile friendly | Works on small screens? |
Adversarial test: Would a busy person with 200 unread emails act on this?
| Criterion | What to evaluate |
|---|---|
| Thumb-stop power | Pattern interrupt in first 2 seconds? |
| Curiosity gap | Creates need to know more? |
| Emotional trigger | Hits a real pain point or desire? |
| Credibility | Believable? Not too good to be true? |
| CTA strength | Clear next step with low friction? |
| Persona match | Speaks directly to target audience? |
| Differentiation | Stands out from competitor ads? |
| Platform native | Fits the platform's style/format? |
Adversarial test: Would this stop YOUR scroll? Would you click?
Always use for:
Can skip for:
## Output v1
[Initial generation]
## Evaluation v1
- Hook strength: 6/10 — Opens weak, no pattern interrupt
- Clarity: 8/10 — Clear enough
- Voice match: 7/10 — Too formal
[... score all criteria]
## Diagnosis
1. Hook needs a surprising stat or contrarian take
2. Voice should be more casual, shorter sentences
3. [...]
## Output v2
[Revised version addressing weaknesses]
## Evaluation v2
[Re-score — continue until all pass]
The loop typically adds 2-3 iterations. Worth it for anything that matters.
Weekly Installs
68
Repository
GitHub Stars
174
First Seen
Feb 19, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykPass
Installed on
opencode59
codex57
gemini-cli57
github-copilot51
claude-code51
cursor50
营销心理学与心智模型应用指南 | 提升营销决策与客户行为理解
45,100 周安装