browser by browserbase/skills
npx skills add https://github.com/browserbase/skills --skill browser使用 browse CLI 配合 Claude 自动化浏览器交互。
在运行任何浏览器命令之前,请验证 CLI 是否可用:
which browse || npm install -g @browserbasehq/browse-cli
CLI 根据可用配置自动选择本地或远程浏览器环境:
BROWSERBASE_API_KEY 和 BROWSERBASE_PROJECT_ID 时激活广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
所有命令在两种模式下工作方式相同。守护进程在首次运行命令时自动启动。
browse open <url> # 跳转到 URL(别名:goto)
browse reload # 重新加载当前页面
browse back # 后退历史记录
browse forward # 前进历史记录
browse snapshot # 获取包含元素引用的无障碍树(快速、结构化)
browse screenshot [path] # 截取视觉截图(较慢,消耗视觉令牌)
browse get url # 获取当前 URL
browse get title # 获取页面标题
browse get text <selector> # 获取文本内容(使用 "body" 获取所有文本)
browse get html <selector> # 获取元素的 HTML 内容
browse get value <selector> # 获取表单字段值
使用 browse snapshot 作为理解页面状态的默认方式 —— 它返回包含元素引用的无障碍树,你可以使用这些引用来进行交互。仅在需要视觉上下文(布局、图像、调试)时使用 browse screenshot。
browse click <ref> # 通过快照中的引用点击元素(例如,@0-5)
browse type <text> # 在聚焦的元素中输入文本
browse fill <selector> <value> # 填写输入框并按 Enter
browse select <selector> <values...> # 选择下拉选项
browse press <key> # 按键(Enter、Tab、Escape、Cmd+A 等)
browse drag <fromX> <fromY> <toX> <toY> # 从一点拖拽到另一点
browse scroll <x> <y> <deltaX> <deltaY> # 在坐标处滚动
browse highlight <selector> # 在页面上高亮元素
browse is visible <selector> # 检查元素是否可见
browse is checked <selector> # 检查元素是否被选中
browse wait <type> [arg] # 等待:加载、选择器、超时
browse stop # 停止浏览器守护进程
browse status # 检查守护进程状态(包括环境)
browse env # 显示当前环境(本地或远程)
browse env local # 切换到本地 Chrome
browse env remote # 切换到 Browserbase(需要 API 密钥)
browse pages # 列出所有打开的标签页
browse tab_switch <index> # 按索引切换标签页
browse tab_close [index] # 关闭标签页
browse open <url> —— 导航到页面browse snapshot —— 读取无障碍树以了解页面结构并获取元素引用browse click <ref> / browse type <text> / browse fill <selector> <value> —— 使用快照中的引用进行交互browse snapshot —— 确认操作成功browse stop —— 完成后关闭浏览器browse open https://example.com
browse snapshot # 查看页面结构 + 元素引用
browse click @0-5 # 点击引用为 0-5 的元素
browse get title
browse stop
| 功能 | 本地 | Browserbase |
|---|---|---|
| 速度 | 更快 | 稍慢 |
| 设置 | 需要 Chrome | 需要 API 密钥 |
| 隐身模式 | 否 | 是(自定义 Chromium,反机器人指纹识别) |
| 验证码解决 | 否 | 是(自动 reCAPTCHA/hCaptcha) |
| 住宅代理 | 否 | 是(201 个国家,地理定位) |
| 会话持久化 | 否 | 是(Cookie/认证信息跨会话持久化) |
| 适用于 | 开发/简单页面 | 受保护站点、机器人检测、生产环境爬取 |
browse open 再进行交互browse snapshot 检查页面状态 —— 它速度快且提供元素引用browse click @0-5browse stop 以清理浏览器会话browse stop,然后检查 browse status。如果仍显示运行中,使用 pkill -f "browse.*daemon" 终止僵尸守护进程,然后重试 browse openbrowse env remotebrowse snapshot 查看可用元素及其引用当检测到以下情况时切换到远程模式:验证码(reCAPTCHA、hCaptcha、Turnstile)、机器人检测页面("正在检查您的浏览器...")、HTTP 403/429、本应有内容的站点显示空白页,或用户要求切换。
对于简单站点(文档、维基、公共 API、localhost)不要切换。
browse env remote # 切换到 Browserbase
browse env local # 切换回本地 Chrome
切换是持久性的,直到你运行 browse stop 或再次切换。如果未设置 API 密钥:
openclaw browserbase setup # 交互式 —— 提示输入 API 密钥 + 项目 ID
有关详细示例,请参阅 EXAMPLES.md。有关 API 参考,请参阅 REFERENCE.md。
每周安装数
139
仓库
GitHub 星标数
452
首次出现
2026年2月4日
安全审计
安装于
codex127
opencode122
gemini-cli121
github-copilot120
kimi-cli118
amp118
Automate browser interactions using the browse CLI with Claude.
Before running any browser commands, verify the CLI is available:
which browse || npm install -g @browserbasehq/browse-cli
The CLI automatically selects between local and remote browser environments based on available configuration:
BROWSERBASE_API_KEY and BROWSERBASE_PROJECT_ID are setAll commands work identically in both modes. The daemon auto-starts on first command.
browse open <url> # Go to URL (aliases: goto)
browse reload # Reload current page
browse back # Go back in history
browse forward # Go forward in history
browse snapshot # Get accessibility tree with element refs (fast, structured)
browse screenshot [path] # Take visual screenshot (slow, uses vision tokens)
browse get url # Get current URL
browse get title # Get page title
browse get text <selector> # Get text content (use "body" for all text)
browse get html <selector> # Get HTML content of element
browse get value <selector> # Get form field value
Use browse snapshot as your default for understanding page state — it returns the accessibility tree with element refs you can use to interact. Only use browse screenshot when you need visual context (layout, images, debugging).
browse click <ref> # Click element by ref from snapshot (e.g., @0-5)
browse type <text> # Type text into focused element
browse fill <selector> <value> # Fill input and press Enter
browse select <selector> <values...> # Select dropdown option(s)
browse press <key> # Press key (Enter, Tab, Escape, Cmd+A, etc.)
browse drag <fromX> <fromY> <toX> <toY> # Drag from one point to another
browse scroll <x> <y> <deltaX> <deltaY> # Scroll at coordinates
browse highlight <selector> # Highlight element on page
browse is visible <selector> # Check if element is visible
browse is checked <selector> # Check if element is checked
browse wait <type> [arg] # Wait for: load, selector, timeout
browse stop # Stop the browser daemon
browse status # Check daemon status (includes env)
browse env # Show current environment (local or remote)
browse env local # Switch to local Chrome
browse env remote # Switch to Browserbase (requires API keys)
browse pages # List all open tabs
browse tab_switch <index> # Switch to tab by index
browse tab_close [index] # Close tab
browse open <url> — navigate to the pagebrowse snapshot — read the accessibility tree to understand page structure and get element refsbrowse click <ref> / browse type <text> / browse fill <selector> <value> — interact using refs from snapshotbrowse snapshot — confirm the action workedbrowse stop — close the browser when donebrowse open https://example.com
browse snapshot # see page structure + element refs
browse click @0-5 # click element with ref 0-5
browse get title
browse stop
| Feature | Local | Browserbase |
|---|---|---|
| Speed | Faster | Slightly slower |
| Setup | Chrome required | API key required |
| Stealth mode | No | Yes (custom Chromium, anti-bot fingerprinting) |
| CAPTCHA solving | No | Yes (automatic reCAPTCHA/hCaptcha) |
| Residential proxies | No | Yes (201 countries, geo-targeting) |
| Session persistence | No | Yes (cookies/auth persist across sessions) |
| Best for | Development/simple pages | Protected sites, bot detection, production scraping |
browse open first before interactingbrowse snapshot to check page state — it's fast and gives you element refsbrowse click @0-5browse stop when done to clean up the browser sessionbrowse stop, then check browse status. If it still says running, kill the zombie daemon with pkill -f "browse.*daemon", then retry browse openbrowse env remotebrowse snapshot to see available elements and their refsSwitch to remote when you detect: CAPTCHAs (reCAPTCHA, hCaptcha, Turnstile), bot detection pages ("Checking your browser..."), HTTP 403/429, empty pages on sites that should have content, or the user asks for it.
Don't switch for simple sites (docs, wikis, public APIs, localhost).
browse env remote # switch to Browserbase
browse env local # switch back to local Chrome
The switch is sticky until you run browse stop or switch again. If API keys aren't set:
openclaw browserbase setup # interactive — prompts for API key + project ID
For detailed examples, see EXAMPLES.md. For API reference, see REFERENCE.md.
Weekly Installs
139
Repository
GitHub Stars
452
First Seen
Feb 4, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykWarn
Installed on
codex127
opencode122
gemini-cli121
github-copilot120
kimi-cli118
amp118
React 组合模式指南:Vercel 组件架构最佳实践,提升代码可维护性
102,200 周安装