ab-test-setup by sickn33/antigravity-awesome-skills
npx skills add https://github.com/sickn33/antigravity-awesome-skills --skill ab-test-setup在编写任何代码之前,确保每个 A/B 测试都是有效、严谨且安全的。
你必须具备:
一个有效的假设应包括:
在设计变体或指标之前,你必须:
明确询问:
“这是我们承诺用于本次测试的最终假设吗?”
在得到确认之前,请勿继续。
明确列出关于以下方面的假设:
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
如果假设薄弱或被违反:
选择最简单的有效测试类型:
除非有明确理由,否则默认选择 A/B 测试。
预先定义:
估算:
没有现实的样本量估算,请勿继续。
仅当以下所有条件都满足时,你才可以进入实施阶段:
如果缺少任何一项,请停止并解决它。
应该做:
不应该做:
解释结果时:
| 结果 | 行动 |
|---|---|
| 显著正向 | 考虑推广 |
| 显著负向 | 拒绝变体,记录学习成果 |
| 不确定 | 考虑增加流量或进行更大胆的改动 |
| 护栏失败 | 即使主要指标成功,也不要发布 |
记录:
将记录存储在共享、可搜索的位置,以避免重复失败。
在以下情况下拒绝继续:
解释原因并建议后续步骤。
A/B 测试不是为了证明想法正确。它是关于自信地学习真相。
如果你感到想要匆忙、简化或“只是试试看”——这就是放慢速度并重新检查设计的信号。
此技能适用于执行概述中描述的工作流程或操作。
每周安装数
294
代码仓库
GitHub 星标数
27.4K
首次出现
Jan 19, 2026
安全审计
安装于
claude-code236
opencode235
gemini-cli234
antigravity213
cursor207
codex206
Ensure every A/B test is valid, rigorous, and safe before a single line of code is written.
You must have:
A valid hypothesis includes:
Before designing variants or metrics, you MUST:
Ask explicitly:
“Is this the final hypothesis we are committing to for this test?”
Do NOT proceed until confirmed.
Explicitly list assumptions about:
If assumptions are weak or violated:
Choose the simplest valid test:
Default to A/B unless there is a clear reason otherwise.
Define upfront:
Estimate:
Do NOT proceed without a realistic sample size estimate.
You may proceed to implementation only if all are true :
If any item is missing, stop and resolve it.
DO:
DO NOT:
When interpreting results:
| Result | Action |
|---|---|
| Significant positive | Consider rollout |
| Significant negative | Reject variant, document learning |
| Inconclusive | Consider more traffic or bolder change |
| Guardrail failure | Do not ship, even if primary wins |
Document:
Store records in a shared, searchable location to avoid repeated failures.
Refuse to proceed if:
Explain why and recommend next steps.
A/B testing is not about proving ideas right. It is about learning the truth with confidence.
If you feel tempted to rush, simplify, or “just try it” — that is the signal to slow down and re-check the design.
This skill is applicable to execute the workflow or actions described in the overview.
Weekly Installs
294
Repository
GitHub Stars
27.4K
First Seen
Jan 19, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykPass
Installed on
claude-code236
opencode235
gemini-cli234
antigravity213
cursor207
codex206
Excel财务建模规范与xlsx文件处理指南:专业格式、零错误公式与数据分析
39,600 周安装