firecrawl-scrape by firecrawl/cli
npx skills add https://github.com/firecrawl/cli --skill firecrawl-scrape抓取一个或多个 URL。返回经过清理、针对 LLM 优化的 Markdown 内容。多个 URL 会并发抓取。
# 基本 Markdown 提取
firecrawl scrape "<url>" -o .firecrawl/page.md
# 仅主要内容,无导航/页脚
firecrawl scrape "<url>" --only-main-content -o .firecrawl/page.md
# 等待 JS 渲染,然后抓取
firecrawl scrape "<url>" --wait-for 3000 -o .firecrawl/page.md
# 多个 URL(每个保存到 .firecrawl/)
firecrawl scrape https://example.com https://example.com/blog https://example.com/docs
# 同时获取 Markdown 和链接
firecrawl scrape "<url>" --format markdown,links -o .firecrawl/page.json
# 询问关于页面的问题
firecrawl scrape "https://example.com/pricing" --query "企业版计划的价格是多少?"
| 选项 | 描述 |
|---|---|
-f, --format <formats> |
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
| 输出格式:markdown, html, rawHtml, links, screenshot, json |
-Q, --query <prompt> | 询问关于页面内容的问题(消耗 5 积分) |
-H | 在输出中包含 HTTP 头信息 |
--only-main-content | 剥离导航、页脚、侧边栏 — 仅保留主要内容 |
--wait-for <ms> | 在抓取前等待 JS 渲染 |
--include-tags <tags> | 仅包含这些 HTML 标签 |
--exclude-tags <tags> | 排除这些 HTML 标签 |
-o, --output <path> | 输出文件路径 |
--query。 将内容抓取到文件,然后使用 grep、head 或直接读取 Markdown — 您可以自己搜索和推理完整内容。仅当您想要一个针对性的答案且不保存页面时(额外消耗 5 积分)才使用 --query。interact。firecrawl --status 了解您的并发限制。--format markdown,links)输出 JSON。? 和 & 解释为特殊字符。.firecrawl/{site}-{path}.mdinteract 进行点击、填写表单等操作每周安装量
6.2K
代码仓库
GitHub 星标数
205
首次出现
14 天前
安全审计
安装于
codex6.1K
opencode6.1K
github-copilot6.1K
gemini-cli6.1K
cursor6.1K
cline6.1K
Scrape one or more URLs. Returns clean, LLM-optimized markdown. Multiple URLs are scraped concurrently.
# Basic markdown extraction
firecrawl scrape "<url>" -o .firecrawl/page.md
# Main content only, no nav/footer
firecrawl scrape "<url>" --only-main-content -o .firecrawl/page.md
# Wait for JS to render, then scrape
firecrawl scrape "<url>" --wait-for 3000 -o .firecrawl/page.md
# Multiple URLs (each saved to .firecrawl/)
firecrawl scrape https://example.com https://example.com/blog https://example.com/docs
# Get markdown and links together
firecrawl scrape "<url>" --format markdown,links -o .firecrawl/page.json
# Ask a question about the page
firecrawl scrape "https://example.com/pricing" --query "What is the enterprise plan price?"
| Option | Description |
|---|---|
-f, --format <formats> | Output formats: markdown, html, rawHtml, links, screenshot, json |
-Q, --query <prompt> | Ask a question about the page content (5 credits) |
-H | Include HTTP headers in output |
--only-main-content | Strip nav, footer, sidebar — main content only |
--wait-for <ms> | Wait for JS rendering before scraping |
--include-tags <tags> | Only include these HTML tags |
--exclude-tags <tags> |
--query. Scrape to a file, then use grep, head, or read the markdown directly — you can search and reason over the full content yourself. Use --query only when you want a single targeted answer without saving the page (costs 5 extra credits).interact when you need interaction (clicks, form fills, pagination).firecrawl --status for your concurrency limit.--format markdown,links) output JSON.? and & as special characters.interact to click, fill forms, etc.Weekly Installs
6.2K
Repository
GitHub Stars
205
First Seen
14 days ago
Security Audits
Gen Agent Trust HubPassSocketPassSnykWarn
Installed on
codex6.1K
opencode6.1K
github-copilot6.1K
gemini-cli6.1K
cursor6.1K
cline6.1K
97,600 周安装
| Exclude these HTML tags |
-o, --output <path> | Output file path |
.firecrawl/{site}-{path}.md