Firecrawl CLI：AI优化的网络爬虫工具，支持搜索、抓取与页面交互

firecrawl by firecrawl/cli

19,300 周安装量

220 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/firecrawl/cli --skill firecrawl

AI/机器学习开发数据分析

🇨🇳中文介绍

Firecrawl CLI

网络爬取、搜索和页面交互的 CLI 工具。返回适用于 LLM 上下文窗口的优化 Markdown 内容。

运行 firecrawl --help 或 firecrawl <command> --help 以获取完整的选项详情。

前提条件

必须已安装并完成身份验证。使用 firecrawl --status 进行检查。

  🔥 firecrawl cli v1.8.0

  ● 通过 FIRECRAWL_API_KEY 完成身份验证
  并发数: 0/100 个任务 (并行爬取限制)
  积分: 剩余 500,000

并发数 : 最大并行任务数。可运行不超过此限制的并行操作。
积分 : 剩余的 API 积分。每次爬取/抓取都会消耗积分。

如果尚未准备就绪，请参阅 rules/install.md。有关输出处理指南，请参阅 rules/security.md。

firecrawl search "query" --scrape --limit 3

工作流程

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

相关 Skills

find-skills 技能搜索工具 - Vercel Labs 开源智能体技能包管理器

733,500 周安装

Vercel React 最佳实践指南 | 58条Next.js性能优化规则与代码重构

252,100 周安装

Vercel Web界面规范检查工具 - 自动检测代码是否符合Web设计指南

202,600 周安装

agent-browser 浏览器自动化工具 - Vercel Labs 命令行网页操作与测试

133,200 周安装

遵循此升级模式：

搜索 - 尚无特定 URL。查找页面、回答问题、发现来源。
抓取 - 已有 URL。直接提取其内容。
映射 + 抓取 - 大型网站或需要特定子页面。使用 map --search 找到正确的 URL，然后抓取它。
爬取 - 需要从整个网站部分（例如所有 /docs/）获取大量内容。
交互 - 先抓取页面，然后与页面交互（分页、模态框、表单提交、多步骤导航）。

需求	命令	适用场景
查找某个主题的页面	`search`	尚无特定 URL
获取页面内容	`scrape`	已有 URL，页面是静态的或 JS 渲染的
在网站内查找 URL	`map`	需要定位特定的子页面
批量提取网站部分内容	`crawl`	需要许多页面（例如所有 /docs/）
AI 驱动的数据提取	`agent`	需要从复杂网站提取结构化数据
与页面交互	`scrape` + `interact`	内容需要点击、填写表单、分页或登录
将网站下载到文件	`download`	将整个网站保存为本地文件

有关详细的命令参考，请运行 firecrawl <command> --help。

抓取与交互：

首先使用 scrape。它处理静态页面和 JS 渲染的 SPA。
当您需要与页面交互时，例如点击按钮、填写表单、在复杂网站中导航、无限滚动，或者当抓取无法获取您需要的所有内容时，请使用 scrape + interact。
切勿使用 interact 进行网络搜索 - 请改用 search。

避免重复获取：

search --scrape 已经获取了完整的页面内容。不要重新抓取这些 URL。
在再次获取之前，请检查 .firecrawl/ 中是否已存在数据。

除非用户指定在上下文中返回，否则使用 -o 将结果写入 .firecrawl/。将 .firecrawl/ 添加到 .gitignore。始终引用 URL - shell 会将 ? 和 & 解释为特殊字符。

firecrawl search "react hooks" -o .firecrawl/search-react-hooks.json --json
firecrawl scrape "<url>" -o .firecrawl/page.md

.firecrawl/search-{query}.json
.firecrawl/search-{query}-scraped.json
.firecrawl/{site}-{path}.md

切勿一次性读取整个输出文件。使用 grep、head 或增量读取：

wc -l .firecrawl/file.md && head -50 .firecrawl/file.md
grep -n "keyword" .firecrawl/file.md

单一格式输出原始内容。多种格式（例如 --format markdown,links）输出 JSON。

在处理基于文件的输出（-o 标志）以完成复杂任务时，这些模式非常有用：

# 从搜索结果中提取 URL
jq -r '.data.web[].url' .firecrawl/search.json

# 获取标题和 URL
jq -r '.data.web[] | "\(.title): \(.url)"' .firecrawl/search.json

并行运行独立操作。使用 firecrawl --status 检查并发限制：

firecrawl scrape "<url-1>" -o .firecrawl/1.md &
firecrawl scrape "<url-2>" -o .firecrawl/2.md &
firecrawl scrape "<url-3>" -o .firecrawl/3.md &
wait

对于交互操作，可以抓取多个页面，并使用各自的抓取 ID 独立地与每个页面进行交互。

firecrawl credit-usage
firecrawl credit-usage --json --pretty -o .firecrawl/credits.json

🇺🇸English

Firecrawl CLI

Web scraping, search, and page interaction CLI. Returns clean markdown optimized for LLM context windows.

Run firecrawl --help or firecrawl <command> --help for full option details.

Prerequisites

Must be installed and authenticated. Check with firecrawl --status.

  🔥 firecrawl cli v1.8.0

  ● Authenticated via FIRECRAWL_API_KEY
  Concurrency: 0/100 jobs (parallel scrape limit)
  Credits: 500,000 remaining

Concurrency : Max parallel jobs. Run parallel operations up to this limit.
Credits : Remaining API credits. Each scrape/crawl consumes credits.

If not ready, see rules/install.md. For output handling guidelines, see rules/security.md.

firecrawl search "query" --scrape --limit 3

Workflow

Follow this escalation pattern:

Search - No specific URL yet. Find pages, answer questions, discover sources.
Scrape - Have a URL. Extract its content directly.
Map + Scrape - Large site or need a specific subpage. Use map --search to find the right URL, then scrape it.
Crawl - Need bulk content from an entire site section (e.g., all /docs/).
Interact - Scrape first, then interact with the page (pagination, modals, form submissions, multi-step navigation).

Need	Command	When
Find pages on a topic	`search`	No specific URL yet
Get a page's content	`scrape`	Have a URL, page is static or JS-rendered
Find URLs within a site	`map`	Need to locate a specific subpage
Bulk extract a site section	`crawl`	Need many pages (e.g., all /docs/)
AI-powered data extraction	`agent`

For detailed command reference, run firecrawl <command> --help.

Scrape vs interact:

Use scrape first. It handles static pages and JS-rendered SPAs.
Use scrape + interact when you need to interact with a page, such as clicking buttons, filling out forms, navigating through a complex site, infinite scroll, or when scrape fails to grab all the content you need.
Never use interact for web searches - use search instead.

Avoid redundant fetches:

search --scrape already fetches full page content. Don't re-scrape those URLs.
Check .firecrawl/ for existing data before fetching again.

Output & Organization

Unless the user specifies to return in context, write results to .firecrawl/ with -o. Add .firecrawl/ to .gitignore. Always quote URLs - shell interprets ? and & as special characters.

firecrawl search "react hooks" -o .firecrawl/search-react-hooks.json --json
firecrawl scrape "<url>" -o .firecrawl/page.md

Naming conventions:

.firecrawl/search-{query}.json
.firecrawl/search-{query}-scraped.json
.firecrawl/{site}-{path}.md

Never read entire output files at once. Use grep, head, or incremental reads:

wc -l .firecrawl/file.md && head -50 .firecrawl/file.md
grep -n "keyword" .firecrawl/file.md

Single format outputs raw content. Multiple formats (e.g., --format markdown,links) output JSON.

Working with Results

These patterns are useful when working with file-based output (-o flag) for complex tasks:

# Extract URLs from search
jq -r '.data.web[].url' .firecrawl/search.json

# Get titles and URLs
jq -r '.data.web[] | "\(.title): \(.url)"' .firecrawl/search.json

Parallelization

Run independent operations in parallel. Check firecrawl --status for concurrency limit:

firecrawl scrape "<url-1>" -o .firecrawl/1.md &
firecrawl scrape "<url-2>" -o .firecrawl/2.md &
firecrawl scrape "<url-3>" -o .firecrawl/3.md &
wait

For interact, scrape multiple pages and interact with each independently using their scrape IDs.

Credit Usage

firecrawl credit-usage
firecrawl credit-usage --json --pretty -o .firecrawl/credits.json

Weekly Installs

18.6K

Repository

firecrawl/cli

GitHub Stars

213

First Seen

Jan 21, 2026

Security Audits

Gen Agent Trust HubPass SocketPass SnykWarn

Installed on

opencode16.8K

codex16.8K

gemini-cli16.7K

github-copilot16.2K

kimi-cli15.6K

amp15.6K

React 组合模式指南：Vercel 组件架构最佳实践，提升代码可维护性

102,200 周安装

Firecrawl CLI：AI优化的网络爬虫工具，支持搜索、抓取与页面交互

🇨🇳中文介绍

Firecrawl CLI

前提条件

工作流程

相关 Skills

输出与组织

处理结果

并行处理

积分使用情况

🇺🇸English

Firecrawl CLI

Prerequisites

Workflow

Output & Organization

Working with Results

Parallelization

Credit Usage

最新 Skills