Dogfood - Vercel Labs 自动化 Web 应用探索与问题报告工具

dogfood by vercel-labs/agent-browser

14,300 周安装量

25,100 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/vercel-labs/agent-browser --skill dogfood

开发监控

🇨🇳中文介绍

Dogfood

系统性地探索 Web 应用程序，发现问题，并为每个发现生成一份包含完整复现证据的报告。

设置

仅 目标 URL 是必需的。其他所有内容都有合理的默认值——除非用户明确提供覆盖值，否则请使用默认值。

参数	默认值	示例覆盖值
目标 URL	(必填)	`vercel.com`, `http://localhost:3000`
会话名称	域名 slug 化（例如，`vercel.com` -> `vercel-com`)	`--session my-session`

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

5. 记录问题（复现优先）

步骤 4 和 5 同时进行——在一次遍历中探索和记录。当你发现问题时，停止探索，立即记录它，然后再继续。不要先探索整个应用，然后再记录。

每个问题都必须是可复现的。当你发现有问题时，不要只是记下来——要用证据来证明它。目标是让阅读报告的人能够清楚地看到发生了什么并可以重放它。

根据问题选择适当级别的证据：

交互式/行为性问题（功能性问题、用户体验问题、操作时控制台错误）

这些问题需要用户交互才能复现——使用完整的复现过程，包括视频和分步截图：

在复现之前，开始录制复现视频：

agent-browser --session {SESSION} record start {OUTPUT_DIR}/videos/issue-{NNN}-repro.webm

以人类的速度逐步操作。 在操作之间暂停 1-2 秒，使视频易于观看。在每个步骤截图：

agent-browser --session {SESSION} screenshot {OUTPUT_DIR}/screenshots/issue-{NNN}-step-1.png

sleep 1
# 执行操作（点击、填写等）
sleep 1
agent-browser --session {SESSION} screenshot {OUTPUT_DIR}/screenshots/issue-{NNN}-step-2.png
sleep 1
# ...继续直到问题显现

3. 捕获损坏状态。 暂停以便查看者能够看到，然后拍摄带注释的截图：

sleep 2
agent-browser --session {SESSION} screenshot --annotate {OUTPUT_DIR}/screenshots/issue-{NNN}-result.png

4. 停止视频：

agent-browser --session {SESSION} record stop

5. 在报告中编写带编号的复现步骤，每个步骤都引用其对应的截图。

静态/加载时可见的问题（拼写错误、占位符文本、文本截断、错位、加载时控制台错误）

这些问题无需交互即可看到——单张带注释的截图就足够了。不需要视频，也不需要多步复现：

agent-browser --session {SESSION} screenshot --annotate {OUTPUT_DIR}/screenshots/issue-{NNN}.png

在报告中写一个简短的描述并引用截图。将 复现视频 设置为 N/A。

对于所有问题：

立即追加到报告中。 不要将问题批量留到以后处理。发现一个就写一个，这样即使会话中断也不会丢失任何内容。
递增问题计数器 (ISSUE-001, ISSUE-002, ...)。

目标是找到 5-10 个记录良好的问题，然后收尾。证据的深度比总数更重要——5 个有完整复现的问题胜过 20 个描述模糊的问题。

重新阅读报告并更新摘要中的严重性计数，使其与实际问题匹配。每个 ### ISSUE- 块都必须在总数中有所体现。
关闭会话：

agent-browser --session {SESSION} close

告诉用户报告已准备就绪，并总结发现：问题总数、按严重性分类的细目，以及最关键的项目。

复现就是一切。 每个问题都需要证据——但要使证据与问题相匹配。交互式错误需要视频和分步截图。静态错误（拼写错误、占位符文本、加载时可见的视觉故障）只需要一张带注释的截图。
在收集证据之前验证可复现性。 在录制视频或截图之前，至少重试一次以验证问题可以复现。如果无法持续复现，则不是一个有效的问题。
不要为静态问题录制视频。 拼写错误或文本截断不会从视频中受益。将视频留给涉及用户交互、时序或状态变化的问题。
对于交互式问题，为每个步骤截图。 捕获操作前、操作中和操作后的状态——这样别人就能看到完整的序列。
编写与截图对应的复现步骤。 报告中的每个编号步骤都应引用其对应的截图。读者应该能够在不接触浏览器的情况下直观地跟随步骤。
使用正确的快照命令。
- snapshot -i — 用于查找可点击/可填充的元素（按钮、输入框、链接）
- snapshot (无标志) — 用于读取页面内容（文本、标题、数据列表）
要彻底但要有判断力。 你不是在遵循测试脚本——你是在像真实用户一样探索。如果感觉不对劲，就深入调查。
增量式记录发现。 每发现一个问题就将其追加到报告中。如果会话中断，发现的内容也会被保存。永远不要将所有问题留到最后批量处理。
永远不要删除输出文件。 不要在会话中途 rm 截图、视频或报告。不要关闭会话并重新开始。向前工作，而不是向后。
永远不要阅读目标应用的源代码。 你是作为用户在测试，而不是审计代码。不要阅读被测试应用的 HTML、JS 或配置文件。所有发现必须来自你在浏览器中观察到的内容。
检查控制台。 许多问题在用户界面中不可见，但会显示为 JS 错误或失败的请求。
像用户一样测试，而不是像机器人。 尝试常见的端到端工作流程。点击真实用户会点击的东西。输入真实的数据。
像人类一样打字。 在视频录制期间填写表单字段时，使用 type 而不是 fill——它会逐个字符地输入。仅在视频录制之外且速度重要时使用 fill。
为人类调整复现视频的节奏。 在操作之间添加 sleep 1，在最终结果截图前添加 sleep 2。视频应该能以 1 倍速观看——审阅报告的人需要看到发生了什么，而不是一连串瞬间的状态变化。
高效使用命令。 当多个 agent-browser 命令相互独立时，将它们批量放在单个 shell 调用中（例如，agent-browser ... screenshot ... && agent-browser ... console）。使用 agent-browser --session {SESSION} scroll down 300 进行滚动——不要使用 key 或 evaluate 来滚动。

参考资料	何时阅读
references/issue-taxonomy.md	会话开始时——校准要查找的内容、严重性级别、探索清单

模板	用途
templates/dogfood-report-template.md	复制到输出目录作为报告文件

🇺🇸English

Dogfood

Systematically explore a web application, find issues, and produce a report with full reproduction evidence for every finding.

Setup

Only the Target URL is required. Everything else has sensible defaults -- use them unless the user explicitly provides an override.

Parameter	Default	Example override
Target URL	(required)	`vercel.com`, `http://localhost:3000`
Session name	Slugified domain (e.g., `vercel.com` -> `vercel-com`)	`--session my-session`
Output directory	`./dogfood-output/`	`Output directory: /tmp/qa`
Scope	Full app	`Focus on the billing page`
Authentication	None	`Sign in to user@example.com`

If the user says something like "dogfood vercel.com", start immediately with defaults. Do not ask clarifying questions unless authentication is mentioned but credentials are missing.

Always use agent-browser directly -- never npx agent-browser. The direct binary uses the fast Rust client. npx routes through Node.js and is significantly slower.

Workflow

1. Initialize    Set up session, output dirs, report file
2. Authenticate  Sign in if needed, save state
3. Orient        Navigate to starting point, take initial snapshot
4. Explore       Systematically visit pages and test features
5. Document      Screenshot + record each issue as found
6. Wrap up       Update summary counts, close session

1. Initialize

mkdir -p {OUTPUT_DIR}/screenshots {OUTPUT_DIR}/videos

Copy the report template into the output directory and fill in the header fields:

cp {SKILL_DIR}/templates/dogfood-report-template.md {OUTPUT_DIR}/report.md

Start a named session:

agent-browser --session {SESSION} open {TARGET_URL}
agent-browser --session {SESSION} wait --load networkidle

2. Authenticate

If the app requires login:

agent-browser --session {SESSION} snapshot -i
# Identify login form refs, fill credentials
agent-browser --session {SESSION} fill @e1 "{EMAIL}"
agent-browser --session {SESSION} fill @e2 "{PASSWORD}"
agent-browser --session {SESSION} click @e3
agent-browser --session {SESSION} wait --load networkidle

For OTP/email codes: ask the user, wait for their response, then enter the code.

After successful login, save state for potential reuse:

agent-browser --session {SESSION} state save {OUTPUT_DIR}/auth-state.json

3. Orient

Take an initial annotated screenshot and snapshot to understand the app structure:

agent-browser --session {SESSION} screenshot --annotate {OUTPUT_DIR}/screenshots/initial.png
agent-browser --session {SESSION} snapshot -i

Identify the main navigation elements and map out the sections to visit.

4. Explore

Read references/issue-taxonomy.md for the full list of what to look for and the exploration checklist.

Strategy -- work through the app systematically:

Start from the main navigation. Visit each top-level section.
Within each section, test interactive elements: click buttons, fill forms, open dropdowns/modals.
Check edge cases: empty states, error handling, boundary inputs.
Try realistic end-to-end workflows (create, edit, delete flows).
Check the browser console for errors periodically.

At each page:

agent-browser --session {SESSION} snapshot -i
agent-browser --session {SESSION} screenshot --annotate {OUTPUT_DIR}/screenshots/{page-name}.png
agent-browser --session {SESSION} errors
agent-browser --session {SESSION} console

Use your judgment on how deep to go. Spend more time on core features and less on peripheral pages. If you find a cluster of issues in one area, investigate deeper.

5. Document Issues (Repro-First)

Steps 4 and 5 happen together -- explore and document in a single pass. When you find an issue, stop exploring and document it immediately before moving on. Do not explore the whole app first and document later.

Every issue must be reproducible. When you find something wrong, do not just note it -- prove it with evidence. The goal is that someone reading the report can see exactly what happened and replay it.

Choose the right level of evidence for the issue:

Interactive / behavioral issues (functional, ux, console errors on action)

These require user interaction to reproduce -- use full repro with video and step-by-step screenshots:

Start a repro video before reproducing:

agent-browser --session {SESSION} record start {OUTPUT_DIR}/videos/issue-{NNN}-repro.webm

Walk through the steps at human pace. Pause 1-2 seconds between actions so the video is watchable. Take a screenshot at each step:

agent-browser --session {SESSION} screenshot {OUTPUT_DIR}/screenshots/issue-{NNN}-step-1.png

sleep 1
# Perform action (click, fill, etc.)
sleep 1
agent-browser --session {SESSION} screenshot {OUTPUT_DIR}/screenshots/issue-{NNN}-step-2.png
sleep 1
# ...continue until the issue manifests

3. Capture the broken state. Pause so the viewer can see it, then take an annotated screenshot:

sleep 2
agent-browser --session {SESSION} screenshot --annotate {OUTPUT_DIR}/screenshots/issue-{NNN}-result.png

4. Stop the video:

agent-browser --session {SESSION} record stop

5. Write numbered repro steps in the report, each referencing its screenshot.

Static / visible-on-load issues (typos, placeholder text, clipped text, misalignment, console errors on load)

These are visible without interaction -- a single annotated screenshot is sufficient. No video, no multi-step repro:

agent-browser --session {SESSION} screenshot --annotate {OUTPUT_DIR}/screenshots/issue-{NNN}.png

Write a brief description and reference the screenshot in the report. Set Repro Video to N/A.

For all issues:

Append to the report immediately. Do not batch issues for later. Write each one as you find it so nothing is lost if the session is interrupted.
Increment the issue counter (ISSUE-001, ISSUE-002, ...).

6. Wrap Up

Aim to find 5-10 well-documented issues , then wrap up. Depth of evidence matters more than total count -- 5 issues with full repro beats 20 with vague descriptions.

After exploring:

Re-read the report and update the summary severity counts so they match the actual issues. Every ### ISSUE- block must be reflected in the totals.
Close the session:

agent-browser --session {SESSION} close

Tell the user the report is ready and summarize findings: total issues, breakdown by severity, and the most critical items.

Guidance

Repro is everything. Every issue needs proof -- but match the evidence to the issue. Interactive bugs need video and step-by-step screenshots. Static bugs (typos, placeholder text, visual glitches visible on load) only need a single annotated screenshot.
Verify reproducibility before collecting evidence. Before recording video or taking screenshots, verify the issue is reproducible with at least one retry. If it can't be reproduced consistently, it's not a valid issue.
Don't record video for static issues. A typo or clipped text doesn't benefit from a video. Save video for issues that involve user interaction, timing, or state changes.
For interactive issues, screenshot each step. Capture the before, the action, and the after -- so someone can see the full sequence.
Write repro steps that map to screenshots. Each numbered step in the report should reference its corresponding screenshot. A reader should be able to follow the steps visually without touching a browser.
Use the right snapshot command.
- snapshot -i — for finding clickable/fillable elements (buttons, inputs, links)
- snapshot (no flag) — for reading page content (text, headings, data lists)
Be thorough but use judgment. You are not following a test script -- you are exploring like a real user would. If something feels off, investigate.
Write findings incrementally. Append each issue to the report as you discover it. If the session is interrupted, findings are preserved. Never batch all issues for the end.
Never delete output files. Do not screenshots, videos, or the report mid-session. Do not close the session and restart. Work forward, not backward.

References

Reference	When to Read
references/issue-taxonomy.md	Start of session -- calibrate what to look for, severity levels, exploration checklist

Templates

Template	Purpose
templates/dogfood-report-template.md	Copy into output directory as the report file

Weekly Installs

13.7K

Repository

vercel-labs/age…-browser

GitHub Stars

24.7K

First Seen

Feb 24, 2026

Security Audits

Gen Agent Trust HubPass SocketPass SnykFail

Installed on

codex12.9K

opencode12.8K

gemini-cli12.8K

cursor12.8K

github-copilot12.8K

kimi-cli12.7K

React 组合模式指南：Vercel 组件架构最佳实践，提升代码可维护性

102,200 周安装

Never read the target app's source code. You are testing as a user, not auditing code. Do not read HTML, JS, or config files of the app under test. All findings must come from what you observe in the browser.

Check the console. Many issues are invisible in the UI but show up as JS errors or failed requests.

Test like a user, not a robot. Try common workflows end-to-end. Click things a real user would click. Enter realistic data.

Type like a human. When filling form fields during video recording, use type instead of fill -- it types character-by-character. Use fill only outside of video recording when speed matters.

Pace repro videos for humans. Add sleep 1 between actions and sleep 2 before the final result screenshot. Videos should be watchable at 1x speed -- a human reviewing the report needs to see what happened, not a blur of instant state changes.

Be efficient with commands. Batch multiple agent-browser commands in a single shell call when they are independent (e.g., agent-browser ... screenshot ... && agent-browser ... console). Use agent-browser --session {SESSION} scroll down 300 for scrolling -- do not use key or evaluate to scroll.

Dogfood - Vercel Labs 自动化 Web 应用探索与问题报告工具

🇨🇳中文介绍

Dogfood

设置

相关 Skills

工作流程

1. 初始化

2. 认证

3. 定位

4. 探索

5. 记录问题（复现优先）

交互式/行为性问题（功能性问题、用户体验问题、操作时控制台错误）

静态/加载时可见的问题（拼写错误、占位符文本、文本截断、错位、加载时控制台错误）

6. 收尾

指导原则

参考资料

模板

🇺🇸English

Dogfood

Setup

Workflow

1. Initialize

2. Authenticate

3. Orient

4. Explore

5. Document Issues (Repro-First)

Interactive / behavioral issues (functional, ux, console errors on action)

Static / visible-on-load issues (typos, placeholder text, clipped text, misalignment, console errors on load)

6. Wrap Up

Guidance

References

Templates

最新 Skills