生存验证探针 (PoL Probe) 指南：低成本验证产品假设，避免原型剧场

pol-probe by deanpeters/product-manager-skills

432 周安装量

2,700 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/deanpeters/product-manager-skills --skill pol-probe

方法论测试产品管理

🇨🇳中文介绍

目的

定义并记录 生存验证探针 —— 一种轻量级、一次性的验证工件，旨在昂贵的开发开始之前揭示严酷的现实。当你需要消除特定风险或测试一个狭窄的假设，而无需构建生产级质量的软件时，请使用此方法。PoL 探针是侦察任务，而非 MVP——它们注定要被删除，而不是被扩展。

此框架旨在防止原型剧场（那些取悦利益相关者但毫无教益的昂贵演示），并迫使你将验证方法与实际的学习目标相匹配。

核心概念

什么是 PoL 探针？

生存验证探针 是一种有意的、一次性的验证实验，旨在以尽可能廉价和快速的方式回答一个具体问题。它不是产品，不是 MVP，也不是试点——它是一次有针对性的真相探寻任务。

起源： 由 Dean Peters 提出，基于 Marty Cagan 2014 年关于原型类型的研究以及 Jeff Patton 的原则："测试你想法最昂贵的方式是构建生产级质量的软件。"

五个基本特征

每个 PoL 探针必须满足以下标准：

特征	含义	重要性
轻量级	最小的资源投入（小时/天，而非周）	如果成本高昂，当数据表明应该停止时，你将难以舍弃它
一次性	明确计划删除，而非扩展	防止沉没成本谬误和范围蔓延
范围狭窄	测试一个具体的假设或风险	宽泛的实验会产生模糊的结果

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

维度	PoL 探针	MVP
目的	通过狭窄的假设测试来降低决策风险	证明想法合理或捍卫路线图方向
范围	单一问题，单一风险	最小的可交付产品增量
生命周期	数小时到数天，然后删除	数周到数月，然后迭代
受众	内部团队 + 狭窄的用户样本	生产环境中的真实客户
保真度	仅足以捕捉信号的假象	生产级质量（或接近）
结果	了解什么不可行	了解什么可行（并交付它）

类型	核心问题	时间线	工具/方法	使用时机
1. 可行性检查	"我们能构建这个吗？"	1-2 天	GenAI 提示链、API 测试、数据完整性扫描、临时代码	技术风险未知；第三方依赖关系不明确
2. 任务导向测试	"用户能否无摩擦地完成这项工作？"	2-5 天	Optimal Workshop、UsabilityHub、任务流程	关键节点（字段标签、决策点、流失区域）需要验证
3. 叙事原型	"这个工作流程能获得利益相关者的支持吗？"	1-3 天	Loom 演示、Sora/Synthesia 视频、幻灯片故事板	你需要"讲述而非测试"——分享故事，衡量兴趣
4. 合成数据模拟	"我们能否在不冒生产风险的情况下对此建模？"	2-4 天	Synthea（用户模拟）、DataStax LangFlow（提示逻辑测试）	边缘案例探索；揭示未知的未知
5. 氛围编码 PoL 探针	"这个解决方案能经受住真实用户的接触吗？"	2-3 天	ChatGPT Canvas + Replit + Airtable = "弗兰肯软件"	你需要关于工作流程/UX 的用户反馈，但不需要生产级代码

何时使用 PoL 探针

✅ 在以下情况使用 PoL 探针：

你有一个具体的、可证伪的假设需要测试
特定风险阻碍了你的下一个决策（技术可行性、用户任务完成度、利益相关者支持）
你需要快速获得严酷的真相（几天内，而非几周）
构建生产软件为时过早或会造成浪费
你可以在开始前阐明"失败"是什么样子

❌ 不要在以下情况使用 PoL 探针：

你试图给高管留下深刻印象（那是原型剧场）
你已经知道答案，只是想要确认（那是确认偏误）
你无法阐明清晰的假设或处置计划
学习目标过于宽泛（"客户会喜欢这个吗？"）
你用它来避免做出艰难的决定

使用 template.md 获取完整的填写结构。

使用此结构来记录你的探针：

# PoL 探针：[描述性名称]

## 假设
[一句话陈述你认为是真的内容]
示例："如果我们将注册表单减少到 3 个字段，完成率将超过 80%。"

## 要消除的风险
[你正在解决什么具体的风险或未知因素？]
示例："我们不知道用户是否会因为表单长度而放弃注册。"

## 原型类型
[从 5 种类型中选择一种]
- [ ] 可行性检查
- [ ] 任务导向测试
- [ ] 叙事原型
- [ ] 合成数据模拟
- [x] 氛围编码 PoL 探针

## 目标用户 / 受众
[谁将与这个探针互动？]
示例："来自我们早期访问等候名单的 10 位用户，非技术型中小企业主。"

## 成功标准（严酷真相）
[你在寻找什么真相？什么能证明你错了？]
- **通过：** 8 位以上用户在 2 分钟内完成注册
- **失败：** 少于 6 位用户完成，或平均时间超过 5 分钟
- **学习：** 识别具体的流失字段

## 工具 / 技术栈
[你将使用什么来构建这个？]
示例："使用 ChatGPT Canvas 构建表单 UI，Airtable 进行数据捕获，Loom 进行事后访谈。"

## 时间线
- **构建：** 2 天
- **测试：** 1 天（10 个用户会话）
- **分析：** 1 天
- **处置：** 第 5 天（删除所有代码，保留学习文档）

## 处置计划
[你将在何时以及如何删除这个探针？]
示例："用户会话完成后，归档录音，删除弗兰肯软件代码，将学习内容记录在 Notion 中。"

## 负责人
[谁负责运行和处置这个探针？]

## 状态
- [ ] 假设已定义
- [ ] 探针已构建
- [ ] 用户已招募
- [ ] 测试已完成
- [ ] 学习内容已记录
- [ ] 探针已处置

在启动你的 PoL 探针之前，请验证：

轻量级： 你能在 1-3 天内构建它吗？
一次性： 你是否承诺了一个处置日期？
范围狭窄： 它是否只测试一个假设？
极度诚实： 如果你错了，数据会带来伤害吗？
微小且专注： 它是否比 MVP 更小？
可证伪： 你能描述"失败"是什么样子吗？
明确负责人： 是否有一人负责执行和处置这个探针？

如果任何答案是"否"，请修改你的探针或重新考虑是否需要它。

查看 examples/sample.md 获取完整的 PoL 探针示例。

迷你示例摘录：

**假设：** 用户能区分"归档"和"删除"
**探针类型：** 任务导向测试
**通过：** 80%+ 的正确解读率

运行宽泛的"用户会喜欢这个吗？"实验，而不是测试一个可证伪的假设
将 PoL 探针视为准 MVP 并拒绝处置它
使用避免令人不适真相的虚荣指标
在测试开始前跳过预定义的失败阈值
先选择工具，后确定假设

pol-probe-advisor （交互式）—— 用于选择原型类型的决策框架
discovery-process （工作流程）—— 在验证阶段使用 PoL 探针
problem-statement （组件）—— 在创建 PoL 探针前定义问题
epic-hypothesis （组件）—— 在使用 PoL 探针测试前构建假设

Jeff Patton — 用户故事地图（精益验证原则）
Marty Cagan — Inspired（2014 年原型类型框架）
Dean Peters — 氛围优先，快速验证，确认契合（Dean Peters 的 Substack，2025）

可行性： GenAI（ChatGPT、Claude）、API 测试工具
任务导向： Optimal Workshop、UsabilityHub
叙事： Loom、Sora、Synthesia、Veo3（文本转视频）
合成数据： Synthea（患者模拟）、DataStax LangFlow
氛围编码： ChatGPT Canvas、Replit、Airtable、Carrd

🇺🇸English

Purpose

Define and document a Proof of Life (PoL) probe —a lightweight, disposable validation artifact designed to surface harsh truths before expensive development. Use this when you need to eliminate a specific risk or test a narrow hypothesis without building production-quality software. PoL probes are reconnaissance missions, not MVPs—they're meant to be deleted, not scaled.

This framework prevents prototype theater (expensive demos that impress stakeholders but teach nothing) and forces you to match validation method to actual learning goal.

Key Concepts

What is a PoL Probe?

A Proof of Life (PoL) probe is a deliberate, disposable validation experiment designed to answer one specific question as cheaply and quickly as possible. It's not a product, not an MVP, not a pilot—it's a targeted truth-seeking mission.

Origin: Coined by Dean Peters (Productside), building on Marty Cagan's 2014 work on prototype flavors and Jeff Patton's principle: "The most expensive way to test your idea is to build production-quality software."

The 5 Essential Characteristics

Every PoL probe must satisfy these criteria:

Characteristic	What It Means	Why It Matters
Lightweight	Minimal resource investment (hours/days, not weeks)	If it's expensive, you'll avoid killing it when the data says to
Disposable	Explicitly planned for deletion, not scaling	Prevents sunk-cost fallacy and scope creep
Narrow Scope	Tests one specific hypothesis or risk	Broad experiments yield ambiguous results
Brutally Honest	Surfaces harsh truths, not vanity metrics	Polite data is useless data
Tiny & Focused	Reconnaissance missions, never MVPs	Small surface area = faster learning cycles

Anti-Pattern: If your "prototype" feels too polished to delete, it's not a PoL probe—it's prototype theater.

PoL Probe vs. MVP

Dimension	PoL Probe	MVP
Purpose	De-risk decisions through narrow hypothesis testing	Justify ideas or defend roadmap direction
Scope	Single question, single risk	Smallest shippable product increment
Lifespan	Hours to days, then deleted	Weeks to months, then iterated
Audience	Internal team + narrow user sample	Real customers in production
Fidelity	Just enough illusion to catch signals	Production-quality (or close)
Outcome	Learn what doesn't work	Learn what does work (and ship it)

Key Distinction: PoL probes are pre-MVP reconnaissance. You run probes to decide if you should build an MVP, not to launch something.

The 5 Prototype Flavors

Match the probe type to your hypothesis, not your tooling comfort.

Type	Core Question	Timeline	Tools/Methods	When to Use
1. Feasibility Checks	"Can we build this?"	1-2 days	GenAI prompt chains, API tests, data integrity sweeps, spike-and-delete code	Technical risk is unknown; third-party dependencies unclear
2. Task-Focused Tests	"Can users complete this job without friction?"	2-5 days	Optimal Workshop, UsabilityHub, task flows	Critical moments (field labels, decision points, drop-off zones) need validation
3. Narrative Prototypes	"Does this workflow earn stakeholder buy-in?"	1-3 days	Loom walkthroughs, Sora/Synthesia videos, slideware storyboards	You need to "tell vs. test"—share the story, measure interest
4. Synthetic Data Simulations	"Can we model this without production risk?"	2-4 days	Synthea (user simulation), DataStax LangFlow (prompt logic testing)

Golden Rule: "Use the cheapest prototype that tells the harshest truth. If it doesn't sting, it's probably just theater."

When to Use a PoL Probe

✅ Use a PoL probe when:

You have a specific, falsifiable hypothesis to test
A particular risk blocks your next decision (technical feasibility, user task completion, stakeholder support)
You need harsh truth fast (within days, not weeks)
Building production software would be premature or wasteful
You can articulate what "failure" looks like before you start

❌ Don't use a PoL probe when:

You're trying to impress executives (that's prototype theater)
You already know the answer and just want validation (that's confirmation bias)
You can't articulate a clear hypothesis or disposal plan
The learning goal is too broad ("Will customers like this?")
You're using it to avoid making a hard decision

Application

Use template.md for the full fill-in structure.

PoL Probe Template

Use this structure to document your probe:

# PoL Probe: [Descriptive Name]

## Hypothesis
[One-sentence statement of what you believe to be true]
Example: "If we reduce the onboarding form to 3 fields, completion rate will exceed 80%."

## Risk Being Eliminated
[What specific risk or unknown are you addressing?]
Example: "We don't know if users will abandon signup due to form length."

## Prototype Type
[Select one of the 5 flavors]
- [ ] Feasibility Check
- [ ] Task-Focused Test
- [ ] Narrative Prototype
- [ ] Synthetic Data Simulation
- [x] Vibe-Coded PoL Probe

## Target Users / Audience
[Who will interact with this probe?]
Example: "10 users from our early access waitlist, non-technical SMB owners."

## Success Criteria (Harsh Truth)
[What truth are you seeking? What would prove you wrong?]
- **Pass:** 8+ users complete signup in under 2 minutes
- **Fail:** <6 users complete, or average time exceeds 5 minutes
- **Learn:** Identify specific drop-off fields

## Tools / Stack
[What will you use to build this?]
Example: "ChatGPT Canvas for form UI, Airtable for data capture, Loom for post-session interviews."

## Timeline
- **Build:** 2 days
- **Test:** 1 day (10 user sessions)
- **Analyze:** 1 day
- **Disposal:** Day 5 (delete all code, keep learnings doc)

## Disposal Plan
[When and how will you delete this?]
Example: "After user sessions complete, archive recordings, delete Frankensoft code, document learnings in Notion."

## Owner
[Who is accountable for running and disposing of this probe?]

## Status
- [ ] Hypothesis defined
- [ ] Probe built
- [ ] Users recruited
- [ ] Testing complete
- [ ] Learnings documented
- [ ] Probe disposed

Quality Checklist

Before launching your PoL probe, verify:

Lightweight: Can you build this in 1-3 days?
Disposable: Have you committed to a disposal date?
Narrow Scope: Does it test ONE hypothesis?
Brutally Honest: Will the data hurt if you're wrong?
Tiny & Focused: Is this smaller than an MVP?
Falsifiable: Can you describe what "failure" looks like?
Clear Owner: Is one person accountable for executing and disposing of this?

If any answer is "no," revise your probe or reconsider whether you need one.

Examples

See examples/sample.md for full PoL probe examples.

Mini example excerpt:

**Hypothesis:** Users can distinguish "archive" vs "delete"
**Probe Type:** Task-Focused Test
**Pass:** 80%+ correct interpretation

Common Pitfalls

Running a broad "will users like this?" experiment instead of testing one falsifiable hypothesis
Treating a PoL probe as a proto-MVP and refusing to dispose of it
Using vanity metrics that avoid uncomfortable truth
Skipping a pre-defined failure threshold before testing begins
Choosing tools first and hypothesis second

References

Related Skills

pol-probe-advisor (Interactive) — Decision framework for choosing which prototype type to use
discovery-process (Workflow) — Use PoL probes in validation phase
problem-statement (Component) — Define problem before creating PoL probe
epic-hypothesis (Component) — Frame hypothesis before testing with PoL probe

External Frameworks

Jeff Patton — User Story Mapping (lean validation principles)
Marty Cagan — Inspired (2014 prototype flavors framework)
Dean Peters — Vibe First, Validate Fast, Verify Fit (Dean Peters' Substack, 2025)

Tools Mentioned

Feasibility: GenAI (ChatGPT, Claude), API testing tools
Task-Focused: Optimal Workshop, UsabilityHub
Narrative: Loom, Sora, Synthesia, Veo3 (text-to-video)
Synthetic Data: Synthea (patient simulation), DataStax LangFlow
Vibe-Coded: ChatGPT Canvas, Replit, Airtable, Carrd

Weekly Installs

213

Repository

deanpeters/prod…r-skills

GitHub Stars

1.5K

First Seen

Feb 12, 2026

Security Audits

Gen Agent Trust HubPass SocketPass SnykPass

Installed on

codex188

opencode188

gemini-cli185

github-copilot184

cursor182

kimi-cli181

代码审查最佳实践指南：完整流程、安全与性能审查清单

12,400 周安装

生存验证探针 (PoL Probe) 指南：低成本验证产品假设，避免原型剧场

🇨🇳中文介绍

目的

核心概念

什么是 PoL 探针？

五个基本特征

相关 Skills

PoL 探针 vs. MVP

五种原型类型

何时使用 PoL 探针

应用

PoL 探针模板

质量检查清单

示例

常见陷阱

参考资料

相关技能

外部框架

提及的工具