Agent Browser by am-will/codex-skills
npx skills add https://github.com/am-will/codex-skills --skill 'Agent Browser'一个基于 Rust 的快速无头浏览器自动化 CLI,支持 Node.js 回退,使 AI 代理能够通过结构化命令导航、点击、输入和捕获页面快照。
npm install -g agent-browser
agent-browser install
agent-browser install --with-deps
git clone https://github.com/vercel-labs/agent-browser
cd agent-browser
pnpm install
pnpm build
agent-browser install
agent-browser open example.com
agent-browser snapshot
agent-browser click @e2
agent-browser fill @e3 "test@example.com"
agent-browser get text @e1
agent-browser screenshot page.png
agent-browser close
对于需要 Google/Discord 等登录的网站(如 star-swap.com):
方法 1:使用自定义配置文件启动 Chrome,通过 CDP 连接
# 终端 1:使用您的真实配置文件和远程调试启动 Chrome
google-chrome --remote-debugging-port=9222 --user-data-dir=/home/willr/.config/google-chrome/Default &
# 终端 2:将 agent-browser 连接到该 Chrome 实例
agent-browser --cdp 9222 open "https://star-swap.com"
agent-browser --cdp 9222 snapshot -i
agent-browser --cdp 9222 click e2
# 这会复用您现有的 Google 会话 - 无需重新登录!
# 适用于:Google OAuth、Discord OAuth 以及您在 Chrome 中已登录的任何网站
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
方法 2:会话持久化(首次手动登录)
# 首次:使用有头模式,手动登录
agent-browser --headed --session starswap open "https://star-swap.com"
# 在浏览器窗口中手动完成 Google OAuth
# 完成后关闭
# 后续运行:Cookie 会持久保存!
agent-browser --session starswap open "https://star-swap.com"
# 已自动登录
am.will.ryan Chrome 配置文件: /home/willr/.config/google-chrome/Default
agent-browser open <url>
agent-browser back
agent-browser forward
agent-browser reload
agent-browser click <sel>
agent-browser dblclick <sel>
agent-browser focus <sel>
agent-browser type <sel> <text>
agent-browser fill <sel> <text>
agent-browser clear <sel>
agent-browser press <key>
agent-browser keydown <key>
agent-browser keyup <key>
agent-browser hover <sel>
agent-browser select <sel> <val>
agent-browser check <sel>
agent-browser uncheck <sel>
agent-browser drag <src> <tgt>
agent-browser upload <sel> <files>
agent-browser snapshot
agent-browser get text <sel>
agent-browser get html <sel>
agent-browser get value <sel>
agent-browser get attr <sel> <attr>
agent-browser get title
agent-browser get url
agent-browser get count <sel>
agent-browser get box <sel>
agent-browser screenshot [path]
agent-browser pdf <path>
agent-browser is visible <sel>
agent-browser is enabled <sel>
agent-browser is checked <sel>
操作包括 click、fill、check、hover 和 text。
agent-browser wait <selector>
agent-browser wait <ms>
agent-browser wait --text "Welcome"
agent-browser wait --url "**/dash"
agent-browser wait --load networkidle
agent-browser scroll <dir> [px]
agent-browser scrollintoview <sel>
agent-browser eval <js>
agent-browser mouse move <x> <y>
agent-browser cookies
agent-browser storage local
agent-browser tab new [url]
agent-browser frame <sel>
agent-browser dialog accept [text]
运行多个隔离的浏览器实例。
agent-browser --session agent1 open site-a.com
agent-browser --session agent2 open site-b.com
快照命令支持过滤以减少输出大小。
引用提供从快照中确定性选择元素的功能。使用 @ref 语法。
agent-browser snapshot
agent-browser click @e2
使用 --json 获取机器可读的输出。
agent-browser snapshot --json
每周安装次数
0
代码仓库
GitHub 星标数
504
首次出现时间
1970年1月1日
安全审计
A fast Rust-based headless browser automation CLI with Node.js fallback that enables AI agents to navigate, click, type, and snapshot pages via structured commands.
npm install -g agent-browser
agent-browser install
agent-browser install --with-deps
git clone https://github.com/vercel-labs/agent-browser
cd agent-browser
pnpm install
pnpm build
agent-browser install
agent-browser open example.com
agent-browser snapshot
agent-browser click @e2
agent-browser fill @e3 "test@example.com"
agent-browser get text @e1
agent-browser screenshot page.png
agent-browser close
For sites requiring Google/Discord/etc login (like star-swap.com):
Method 1: Launch Chrome with custom profile, connect via CDP
# Terminal 1: Launch Chrome with your real profile and remote debugging
google-chrome --remote-debugging-port=9222 --user-data-dir=/home/willr/.config/google-chrome/Default &
# Terminal 2: Connect agent-browser to that Chrome instance
agent-browser --cdp 9222 open "https://star-swap.com"
agent-browser --cdp 9222 snapshot -i
agent-browser --cdp 9222 click e2
# This reuses your existing Google session - no re-login needed!
# Works for: Google OAuth, Discord OAuth, any site you're logged into in Chrome
Method 2: Session persistence (first-time manual login)
# First time: headed mode, login manually
agent-browser --headed --session starswap open "https://star-swap.com"
# Complete Google OAuth manually in the browser window
# Close when done
# Future runs: cookies persist!
agent-browser --session starswap open "https://star-swap.com"
# Already logged in automatically
am.will.ryan Chrome profile: /home/willr/.config/google-chrome/Default
agent-browser open <url>
agent-browser back
agent-browser forward
agent-browser reload
agent-browser click <sel>
agent-browser dblclick <sel>
agent-browser focus <sel>
agent-browser type <sel> <text>
agent-browser fill <sel> <text>
agent-browser clear <sel>
agent-browser press <key>
agent-browser keydown <key>
agent-browser keyup <key>
agent-browser hover <sel>
agent-browser select <sel> <val>
agent-browser check <sel>
agent-browser uncheck <sel>
agent-browser drag <src> <tgt>
agent-browser upload <sel> <files>
agent-browser snapshot
agent-browser get text <sel>
agent-browser get html <sel>
agent-browser get value <sel>
agent-browser get attr <sel> <attr>
agent-browser get title
agent-browser get url
agent-browser get count <sel>
agent-browser get box <sel>
agent-browser screenshot [path]
agent-browser pdf <path>
agent-browser is visible <sel>
agent-browser is enabled <sel>
agent-browser is checked <sel>
Actions include click, fill, check, hover, and text.
agent-browser wait <selector>
agent-browser wait <ms>
agent-browser wait --text "Welcome"
agent-browser wait --url "**/dash"
agent-browser wait --load networkidle
agent-browser scroll <dir> [px]
agent-browser scrollintoview <sel>
agent-browser eval <js>
agent-browser mouse move <x> <y>
agent-browser cookies
agent-browser storage local
agent-browser tab new [url]
agent-browser frame <sel>
agent-browser dialog accept [text]
Run multiple isolated browser instances.
agent-browser --session agent1 open site-a.com
agent-browser --session agent2 open site-b.com
The snapshot command supports filtering to reduce output size.
Refs provide deterministic element selection from snapshots. Use the @ref syntax.
agent-browser snapshot
agent-browser click @e2
Use --json for machine readable output.
agent-browser snapshot --json
Weekly Installs
0
Repository
GitHub Stars
504
First Seen
Jan 1, 1970
Security Audits
通过 LiteLLM 代理让 Claude Code 对接 GitHub Copilot 运行 | 高级变通方案指南
31,600 周安装