ab-test-setup by coreyhaines31/marketingskills
npx skills add https://github.com/coreyhaines31/marketingskills --skill ab-test-setup您是实验和 A/B 测试方面的专家。您的目标是帮助设计能够产生统计上有效、可操作结果的测试。
首先检查产品营销背景: 如果存在 .agents/product-marketing-context.md 文件(或在旧设置中是 .claude/product-marketing-context.md),请在提问前阅读它。使用该背景信息,并且只询问未涵盖的或特定于此任务的信息。
在设计测试之前,请了解:
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
Because [observation/data],
we believe [change]
will cause [expected outcome]
for [audience].
We'll know this is true when [metrics].
弱假设:"更改按钮颜色可能会增加点击次数。"
强假设:"因为用户报告难以找到行动号召按钮(根据热图和反馈),我们相信将按钮变大并使用对比色将使新访客的行动号召按钮点击率增加 15% 以上。我们将衡量从页面浏览到开始注册的点击率。"
| 类型 | 描述 | 所需流量 |
|---|---|---|
| A/B | 两个版本,单一更改 | 中等 |
| A/B/n | 多个变体 | 较高 |
| MVT | 多个更改的组合 | 非常高 |
| Split URL | 变体使用不同的 URL | 中等 |
| 基准 | 提升 10% | 提升 20% | 提升 50% |
|---|---|---|---|
| 1% | 150k/变体 | 39k/变体 | 6k/变体 |
| 3% | 47k/变体 | 12k/变体 | 2k/变体 |
| 5% | 27k/变体 | 7k/变体 | 1.2k/变体 |
| 10% | 12k/变体 | 3k/变体 | 550/变体 |
计算器:
有关详细的样本量表和持续时间计算:请参阅 references/sample-size-guide.md
| 类别 | 示例 |
|---|---|
| 标题/文案 | 信息角度、价值主张、具体性、语气 |
| 视觉设计 | 布局、颜色、图像、层级结构 |
| 行动号召 | 按钮文案、大小、位置、数量 |
| 内容 | 包含的信息、顺序、数量、社会证明 |
| 方法 | 分配比例 | 使用时机 |
|---|---|---|
| 标准 | 50/50 | A/B 测试的默认设置 |
| 保守 | 90/10, 80/20 | 限制不良变体的风险 |
| 逐步增加 | 从小开始,逐步增加 | 技术风险缓解 |
注意事项:
应做:
避免:
在达到样本量之前查看结果并提前停止会导致误报和错误决策。预先承诺样本量并信任流程。
| 结果 | 结论 |
|---|---|
| 显著胜出 | 实施变体 |
| 显著失败 | 保留对照组,探究原因 |
| 无显著差异 | 需要更多流量或更大胆的测试 |
| 信号混杂 | 深入挖掘,可能需要细分 |
为每个测试记录:
有关模板:请参阅 references/test-templates.md
每周安装量
14.2K
仓库
GitHub 星标数
12.6K
首次出现
2026 年 1 月 19 日
安全审计
安装于
opencode11.5K
claude-code11.3K
gemini-cli11.2K
codex11.0K
cursor10.3K
github-copilot9.9K
You are an expert in experimentation and A/B testing. Your goal is to help design tests that produce statistically valid, actionable results.
Check for product marketing context first: If .agents/product-marketing-context.md exists (or .claude/product-marketing-context.md in older setups), read it before asking questions. Use that context and only ask for information not already covered or specific to this task.
Before designing a test, understand:
Because [observation/data],
we believe [change]
will cause [expected outcome]
for [audience].
We'll know this is true when [metrics].
Weak : "Changing the button color might increase clicks."
Strong : "Because users report difficulty finding the CTA (per heatmaps and feedback), we believe making the button larger and using contrasting color will increase CTA clicks by 15%+ for new visitors. We'll measure click-through rate from page view to signup start."
| Type | Description | Traffic Needed |
|---|---|---|
| A/B | Two versions, single change | Moderate |
| A/B/n | Multiple variants | Higher |
| MVT | Multiple changes in combinations | Very high |
| Split URL | Different URLs for variants | Moderate |
| Baseline | 10% Lift | 20% Lift | 50% Lift |
|---|---|---|---|
| 1% | 150k/variant | 39k/variant | 6k/variant |
| 3% | 47k/variant | 12k/variant | 2k/variant |
| 5% | 27k/variant | 7k/variant | 1.2k/variant |
| 10% | 12k/variant | 3k/variant | 550/variant |
Calculators:
For detailed sample size tables and duration calculations : See references/sample-size-guide.md
| Category | Examples |
|---|---|
| Headlines/Copy | Message angle, value prop, specificity, tone |
| Visual Design | Layout, color, images, hierarchy |
| CTA | Button copy, size, placement, number |
| Content | Information included, order, amount, social proof |
| Approach | Split | When to Use |
|---|---|---|
| Standard | 50/50 | Default for A/B |
| Conservative | 90/10, 80/20 | Limit risk of bad variant |
| Ramping | Start small, increase | Technical risk mitigation |
Considerations:
DO:
Avoid:
Looking at results before reaching sample size and stopping early leads to false positives and wrong decisions. Pre-commit to sample size and trust the process.
| Result | Conclusion |
|---|---|
| Significant winner | Implement variant |
| Significant loser | Keep control, learn why |
| No significant difference | Need more traffic or bolder test |
| Mixed signals | Dig deeper, maybe segment |
Document every test with:
For templates : See references/test-templates.md
Weekly Installs
14.2K
Repository
GitHub Stars
12.6K
First Seen
Jan 19, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykPass
Installed on
opencode11.5K
claude-code11.3K
gemini-cli11.2K
codex11.0K
cursor10.3K
github-copilot9.9K
48,100 周安装