Firecrawl Scrape：智能网页抓取工具，一键提取LLM优化Markdown内容

firecrawl-scrape by firecrawl/cli

6,300 周安装量

205 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/firecrawl/cli --skill firecrawl-scrape

AI/机器学习开发数据分析

🇨🇳中文介绍

firecrawl scrape

抓取一个或多个 URL。返回经过清理、针对 LLM 优化的 Markdown 内容。多个 URL 会并发抓取。

使用时机

您有一个特定的 URL 并希望获取其内容
页面是静态的或由 JavaScript 渲染（SPA）
工作流程升级模式中的第二步：搜索 → 抓取 → 映射 → 爬取 → 交互

快速开始

# 基本 Markdown 提取
firecrawl scrape "<url>" -o .firecrawl/page.md

# 仅主要内容，无导航/页脚
firecrawl scrape "<url>" --only-main-content -o .firecrawl/page.md

# 等待 JS 渲染，然后抓取
firecrawl scrape "<url>" --wait-for 3000 -o .firecrawl/page.md

# 多个 URL（每个保存到 .firecrawl/）
firecrawl scrape https://example.com https://example.com/blog https://example.com/docs

# 同时获取 Markdown 和链接
firecrawl scrape "<url>" --format markdown,links -o .firecrawl/page.json

# 询问关于页面的问题
firecrawl scrape "https://example.com/pricing" --query "企业版计划的价格是多少？"

选项

选项	描述
`-f, --format <formats>`

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

🇺🇸English

firecrawl scrape

Scrape one or more URLs. Returns clean, LLM-optimized markdown. Multiple URLs are scraped concurrently.

When to use

You have a specific URL and want its content
The page is static or JS-rendered (SPA)
Step 2 in the workflow escalation pattern: search → scrape → map → crawl → interact

Quick start

# Basic markdown extraction
firecrawl scrape "<url>" -o .firecrawl/page.md

# Main content only, no nav/footer
firecrawl scrape "<url>" --only-main-content -o .firecrawl/page.md

# Wait for JS to render, then scrape
firecrawl scrape "<url>" --wait-for 3000 -o .firecrawl/page.md

# Multiple URLs (each saved to .firecrawl/)
firecrawl scrape https://example.com https://example.com/blog https://example.com/docs

# Get markdown and links together
firecrawl scrape "<url>" --format markdown,links -o .firecrawl/page.json

# Ask a question about the page
firecrawl scrape "https://example.com/pricing" --query "What is the enterprise plan price?"

Options

Option	Description
`-f, --format <formats>`	Output formats: markdown, html, rawHtml, links, screenshot, json
`-Q, --query <prompt>`	Ask a question about the page content (5 credits)
`-H`	Include HTTP headers in output
`--only-main-content`	Strip nav, footer, sidebar — main content only
`--wait-for <ms>`	Wait for JS rendering before scraping
`--include-tags <tags>`	Only include these HTML tags
`--exclude-tags <tags>`

Tips

Prefer plain scrape over--query. Scrape to a file, then use grep, head, or read the markdown directly — you can search and reason over the full content yourself. Use --query only when you want a single targeted answer without saving the page (costs 5 extra credits).
Try scrape before interact. Scrape handles static pages and JS-rendered SPAs. Only escalate to interact when you need interaction (clicks, form fills, pagination).
Multiple URLs are scraped concurrently — check firecrawl --status for your concurrency limit.
Single format outputs raw content. Multiple formats (e.g., --format markdown,links) output JSON.
Always quote URLs — shell interprets ? and & as special characters.

Firecrawl Scrape：智能网页抓取工具，一键提取LLM优化Markdown内容

🇨🇳中文介绍

firecrawl scrape

使用时机

快速开始

选项

相关 Skills

提示

另请参阅

🇺🇸English

firecrawl scrape

When to use

Quick start

Options

Tips

See also