pol-probe by deanpeters/product-manager-skills
npx skills add https://github.com/deanpeters/product-manager-skills --skill pol-probe定义并记录 生存验证探针 —— 一种轻量级、一次性的验证工件,旨在昂贵的开发开始之前揭示严酷的现实。当你需要消除特定风险或测试一个狭窄的假设,而无需构建生产级质量的软件时,请使用此方法。PoL 探针是侦察任务,而非 MVP——它们注定要被删除,而不是被扩展。
此框架旨在防止原型剧场(那些取悦利益相关者但毫无教益的昂贵演示),并迫使你将验证方法与实际的学习目标相匹配。
生存验证探针 是一种有意的、一次性的验证实验,旨在以尽可能廉价和快速的方式回答一个具体问题。它不是产品,不是 MVP,也不是试点——它是一次有针对性的真相探寻任务。
起源: 由 Dean Peters 提出,基于 Marty Cagan 2014 年关于原型类型的研究以及 Jeff Patton 的原则:"测试你想法最昂贵的方式是构建生产级质量的软件。"
每个 PoL 探针必须满足以下标准:
| 特征 | 含义 | 重要性 |
|---|---|---|
| 轻量级 | 最小的资源投入(小时/天,而非周) | 如果成本高昂,当数据表明应该停止时,你将难以舍弃它 |
| 一次性 | 明确计划删除,而非扩展 | 防止沉没成本谬误和范围蔓延 |
| 范围狭窄 | 测试一个具体的假设或风险 | 宽泛的实验会产生模糊的结果 |
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
| 揭示严酷的现实,而非虚荣指标 |
| 温和的数据是无用的数据 |
| 微小且专注 | 侦察任务,绝非 MVP | 小表面积 = 更快的学习周期 |
反面模式: 如果你的"原型"感觉过于精美而不忍删除,那它就不是 PoL 探针——而是原型剧场。
| 维度 | PoL 探针 | MVP |
|---|---|---|
| 目的 | 通过狭窄的假设测试来降低决策风险 | 证明想法合理或捍卫路线图方向 |
| 范围 | 单一问题,单一风险 | 最小的可交付产品增量 |
| 生命周期 | 数小时到数天,然后删除 | 数周到数月,然后迭代 |
| 受众 | 内部团队 + 狭窄的用户样本 | 生产环境中的真实客户 |
| 保真度 | 仅足以捕捉信号的假象 | 生产级质量(或接近) |
| 结果 | 了解什么不可行 | 了解什么可行(并交付它) |
关键区别: PoL 探针是 MVP 前的侦察。你运行探针是为了决定是否应该构建 MVP,而不是为了发布某个东西。
根据你的假设选择探针类型,而不是根据你熟悉的工具。
| 类型 | 核心问题 | 时间线 | 工具/方法 | 使用时机 |
|---|---|---|---|---|
| 1. 可行性检查 | "我们能构建这个吗?" | 1-2 天 | GenAI 提示链、API 测试、数据完整性扫描、临时代码 | 技术风险未知;第三方依赖关系不明确 |
| 2. 任务导向测试 | "用户能否无摩擦地完成这项工作?" | 2-5 天 | Optimal Workshop、UsabilityHub、任务流程 | 关键节点(字段标签、决策点、流失区域)需要验证 |
| 3. 叙事原型 | "这个工作流程能获得利益相关者的支持吗?" | 1-3 天 | Loom 演示、Sora/Synthesia 视频、幻灯片故事板 | 你需要"讲述而非测试"——分享故事,衡量兴趣 |
| 4. 合成数据模拟 | "我们能否在不冒生产风险的情况下对此建模?" | 2-4 天 | Synthea(用户模拟)、DataStax LangFlow(提示逻辑测试) | 边缘案例探索;揭示未知的未知 |
| 5. 氛围编码 PoL 探针 | "这个解决方案能经受住真实用户的接触吗?" | 2-3 天 | ChatGPT Canvas + Replit + Airtable = "弗兰肯软件" | 你需要关于工作流程/UX 的用户反馈,但不需要生产级代码 |
黄金法则: "使用能揭示最严酷真相的最廉价原型。如果它不刺痛你,那很可能只是剧场。"
✅ 在以下情况使用 PoL 探针:
❌ 不要在以下情况使用 PoL 探针:
使用 template.md 获取完整的填写结构。
使用此结构来记录你的探针:
# PoL 探针:[描述性名称]
## 假设
[一句话陈述你认为是真的内容]
示例:"如果我们将注册表单减少到 3 个字段,完成率将超过 80%。"
## 要消除的风险
[你正在解决什么具体的风险或未知因素?]
示例:"我们不知道用户是否会因为表单长度而放弃注册。"
## 原型类型
[从 5 种类型中选择一种]
- [ ] 可行性检查
- [ ] 任务导向测试
- [ ] 叙事原型
- [ ] 合成数据模拟
- [x] 氛围编码 PoL 探针
## 目标用户 / 受众
[谁将与这个探针互动?]
示例:"来自我们早期访问等候名单的 10 位用户,非技术型中小企业主。"
## 成功标准(严酷真相)
[你在寻找什么真相?什么能证明你错了?]
- **通过:** 8 位以上用户在 2 分钟内完成注册
- **失败:** 少于 6 位用户完成,或平均时间超过 5 分钟
- **学习:** 识别具体的流失字段
## 工具 / 技术栈
[你将使用什么来构建这个?]
示例:"使用 ChatGPT Canvas 构建表单 UI,Airtable 进行数据捕获,Loom 进行事后访谈。"
## 时间线
- **构建:** 2 天
- **测试:** 1 天(10 个用户会话)
- **分析:** 1 天
- **处置:** 第 5 天(删除所有代码,保留学习文档)
## 处置计划
[你将在何时以及如何删除这个探针?]
示例:"用户会话完成后,归档录音,删除弗兰肯软件代码,将学习内容记录在 Notion 中。"
## 负责人
[谁负责运行和处置这个探针?]
## 状态
- [ ] 假设已定义
- [ ] 探针已构建
- [ ] 用户已招募
- [ ] 测试已完成
- [ ] 学习内容已记录
- [ ] 探针已处置
在启动你的 PoL 探针之前,请验证:
如果任何答案是"否",请修改你的探针或重新考虑是否需要它。
查看 examples/sample.md 获取完整的 PoL 探针示例。
迷你示例摘录:
**假设:** 用户能区分"归档"和"删除"
**探针类型:** 任务导向测试
**通过:** 80%+ 的正确解读率
每周安装量
213
代码库
GitHub 星标数
1.5K
首次出现
2026年2月12日
安全审计
安装于
codex188
opencode188
gemini-cli185
github-copilot184
cursor182
kimi-cli181
Define and document a Proof of Life (PoL) probe —a lightweight, disposable validation artifact designed to surface harsh truths before expensive development. Use this when you need to eliminate a specific risk or test a narrow hypothesis without building production-quality software. PoL probes are reconnaissance missions, not MVPs—they're meant to be deleted, not scaled.
This framework prevents prototype theater (expensive demos that impress stakeholders but teach nothing) and forces you to match validation method to actual learning goal.
A Proof of Life (PoL) probe is a deliberate, disposable validation experiment designed to answer one specific question as cheaply and quickly as possible. It's not a product, not an MVP, not a pilot—it's a targeted truth-seeking mission.
Origin: Coined by Dean Peters (Productside), building on Marty Cagan's 2014 work on prototype flavors and Jeff Patton's principle: "The most expensive way to test your idea is to build production-quality software."
Every PoL probe must satisfy these criteria:
| Characteristic | What It Means | Why It Matters |
|---|---|---|
| Lightweight | Minimal resource investment (hours/days, not weeks) | If it's expensive, you'll avoid killing it when the data says to |
| Disposable | Explicitly planned for deletion, not scaling | Prevents sunk-cost fallacy and scope creep |
| Narrow Scope | Tests one specific hypothesis or risk | Broad experiments yield ambiguous results |
| Brutally Honest | Surfaces harsh truths, not vanity metrics | Polite data is useless data |
| Tiny & Focused | Reconnaissance missions, never MVPs | Small surface area = faster learning cycles |
Anti-Pattern: If your "prototype" feels too polished to delete, it's not a PoL probe—it's prototype theater.
| Dimension | PoL Probe | MVP |
|---|---|---|
| Purpose | De-risk decisions through narrow hypothesis testing | Justify ideas or defend roadmap direction |
| Scope | Single question, single risk | Smallest shippable product increment |
| Lifespan | Hours to days, then deleted | Weeks to months, then iterated |
| Audience | Internal team + narrow user sample | Real customers in production |
| Fidelity | Just enough illusion to catch signals | Production-quality (or close) |
| Outcome | Learn what doesn't work | Learn what does work (and ship it) |
Key Distinction: PoL probes are pre-MVP reconnaissance. You run probes to decide if you should build an MVP, not to launch something.
Match the probe type to your hypothesis, not your tooling comfort.
| Type | Core Question | Timeline | Tools/Methods | When to Use |
|---|---|---|---|---|
| 1. Feasibility Checks | "Can we build this?" | 1-2 days | GenAI prompt chains, API tests, data integrity sweeps, spike-and-delete code | Technical risk is unknown; third-party dependencies unclear |
| 2. Task-Focused Tests | "Can users complete this job without friction?" | 2-5 days | Optimal Workshop, UsabilityHub, task flows | Critical moments (field labels, decision points, drop-off zones) need validation |
| 3. Narrative Prototypes | "Does this workflow earn stakeholder buy-in?" | 1-3 days | Loom walkthroughs, Sora/Synthesia videos, slideware storyboards | You need to "tell vs. test"—share the story, measure interest |
| 4. Synthetic Data Simulations | "Can we model this without production risk?" | 2-4 days | Synthea (user simulation), DataStax LangFlow (prompt logic testing) |
Golden Rule: "Use the cheapest prototype that tells the harshest truth. If it doesn't sting, it's probably just theater."
✅ Use a PoL probe when:
❌ Don't use a PoL probe when:
Use template.md for the full fill-in structure.
Use this structure to document your probe:
# PoL Probe: [Descriptive Name]
## Hypothesis
[One-sentence statement of what you believe to be true]
Example: "If we reduce the onboarding form to 3 fields, completion rate will exceed 80%."
## Risk Being Eliminated
[What specific risk or unknown are you addressing?]
Example: "We don't know if users will abandon signup due to form length."
## Prototype Type
[Select one of the 5 flavors]
- [ ] Feasibility Check
- [ ] Task-Focused Test
- [ ] Narrative Prototype
- [ ] Synthetic Data Simulation
- [x] Vibe-Coded PoL Probe
## Target Users / Audience
[Who will interact with this probe?]
Example: "10 users from our early access waitlist, non-technical SMB owners."
## Success Criteria (Harsh Truth)
[What truth are you seeking? What would prove you wrong?]
- **Pass:** 8+ users complete signup in under 2 minutes
- **Fail:** <6 users complete, or average time exceeds 5 minutes
- **Learn:** Identify specific drop-off fields
## Tools / Stack
[What will you use to build this?]
Example: "ChatGPT Canvas for form UI, Airtable for data capture, Loom for post-session interviews."
## Timeline
- **Build:** 2 days
- **Test:** 1 day (10 user sessions)
- **Analyze:** 1 day
- **Disposal:** Day 5 (delete all code, keep learnings doc)
## Disposal Plan
[When and how will you delete this?]
Example: "After user sessions complete, archive recordings, delete Frankensoft code, document learnings in Notion."
## Owner
[Who is accountable for running and disposing of this probe?]
## Status
- [ ] Hypothesis defined
- [ ] Probe built
- [ ] Users recruited
- [ ] Testing complete
- [ ] Learnings documented
- [ ] Probe disposed
Before launching your PoL probe, verify:
If any answer is "no," revise your probe or reconsider whether you need one.
See examples/sample.md for full PoL probe examples.
Mini example excerpt:
**Hypothesis:** Users can distinguish "archive" vs "delete"
**Probe Type:** Task-Focused Test
**Pass:** 80%+ correct interpretation
Weekly Installs
213
Repository
GitHub Stars
1.5K
First Seen
Feb 12, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykPass
Installed on
codex188
opencode188
gemini-cli185
github-copilot184
cursor182
kimi-cli181
代码审查最佳实践指南:完整流程、安全与性能审查清单
12,400 周安装
竞争对手研究指南:SEO、内容、反向链接与定价分析工具
231 周安装
Azure 工作负载自动升级评估工具 - 支持 Functions、App Service 计划与 SKU 迁移
231 周安装
Kaizen持续改进方法论:软件开发中的渐进式优化与防错设计实践指南
231 周安装
软件UI/UX设计指南:以用户为中心的设计原则、WCAG可访问性与平台规范
231 周安装
Apify 网络爬虫和自动化平台 - 无需编码抓取亚马逊、谷歌、领英等网站数据
231 周安装
llama.cpp 中文指南:纯 C/C++ LLM 推理,CPU/非 NVIDIA 硬件优化部署
231 周安装
| Edge case exploration; unknown-unknown surfacing |
| 5. Vibe-Coded PoL Probes | "Will this solution survive real user contact?" | 2-3 days | ChatGPT Canvas + Replit + Airtable = "Frankensoft" | You need user feedback on workflow/UX, but not production-grade code |