PDF转Markdown工具：自动检测原生/扫描文档，支持OCR转换

pdf-to-markdown by duc01226/easyplatform

375 周安装量

5 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/duc01226/easyplatform --skill pdf-to-markdown

内容创作自动化文件管理

🇨🇳中文介绍

[IMPORTANT] 在开始前务必使用 TaskCreate 将所有工作拆分为小任务——包括每个文件的读取任务。这可以防止因处理长文件而丢失上下文。对于简单任务，AI 必须询问用户是否跳过。

快速摘要

目标： 将 PDF 文件转换为格式良好的 Markdown，并自动检测原生文本与扫描文档。

工作流程：

自动检测 — 确定 PDF 是否包含原生文本或需要 OCR
转换 — 使用输入路径和可选的模式/输出标志运行 scripts/convert.cjs
输出 — 返回包含成功状态、页数和输出路径的 JSON

关键规则：

使用 --mode auto（默认）让工具决定使用原生模式还是 OCR 模式
扫描版 PDF 的 OCR 需要额外的 tesseract.js 设置
复杂的多栏布局可能无法完美保留结构

保持怀疑态度。运用批判性思维、顺序性思维。每个主张都需要可追溯的证据，并给出置信度百分比（想法应超过 80%）。

pdf-to-markdown

将 PDF 文件转换为 Markdown 格式，自动检测原生文本与扫描文档。

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

相关 Skills

FlyClaw：零登录航班聚合查询工具，Python实现多源航班信息与价格搜索

4,000,000 周安装

Gmail过滤器创建教程 - 使用Google Workspace CLI自动分类邮件与添加标签

6,700 周安装

Google Slides 演示文稿创建与共享自动化教程 - 使用 Google Workspace CLI

6,500 周安装

Google Calendar 会议重新安排技能 - 自动更新会议时间并通知参与者

6,300 周安装

此技能需要 npm 依赖项。 运行以下命令之一：

# 选项 1：通过 ClaudeKit CLI 安装（推荐）
ck init  # 运行 install.sh 以处理所有技能

# 选项 2：手动安装
cd .claude/skills/pdf-to-markdown
npm install

依赖项： @opendocsg/pdf2md（原生 PDF），pdfjs-dist（PDF 解析）

注意： 扫描版 PDF 的 OCR 需要额外设置（见 OCR 部分）。

# 基本转换（自动检测原生或扫描）
node .claude/skills/pdf-to-markdown/scripts/convert.cjs --input ./document.pdf

# 指定输出路径
node .claude/skills/pdf-to-markdown/scripts/convert.cjs -i ./doc.pdf -o ./output.md

# 强制原生模式（跳过 OCR 检测）
node .claude/skills/pdf-to-markdown/scripts/convert.cjs -i ./doc.pdf --mode native

选项	简写	描述	默认值
`--input`	`-i`	输入 PDF 文件路径	（必需）
`--output`	`-o`	输出 Markdown 文件路径	`{input}.md`
`--mode`	`-m`	转换模式：`auto`、`native`、`ocr`	`auto`
`--help`	`-h`	显示帮助信息

自动检测： 自动判断 PDF 是否包含原生文本或需要 OCR
原生 PDF： 使用 @opendocsg/pdf2md 进行快速提取
表格： 基本保留表格结构
跨平台： 支持 Windows、macOS、Linux
无系统依赖： 纯 JavaScript 实现

检查 PDF 第一页是否有可提取的文本。如果找到文本则使用原生提取，否则回退到 OCR 警告。

快速直接文本提取。最适合包含可选文本（非扫描图像）的 PDF。

OCR（扫描版 PDF）- 即将推出

用于扫描文档。当前未实现——如果 PDF 看起来是扫描版，技能会通知您。

成功时返回 JSON：

{
    "success": true,
    "input": "/path/to/input.pdf",
    "output": "/path/to/output.md",
    "stats": {
        "pages": 5,
        "mode": "native"
    }
}

复杂的多栏布局可能无法保留结构
扫描版 PDF 的 OCR 准确度取决于图像质量
数学公式可能无法完美转换
首次运行 OCR 会下载语言数据（约 15MB）

OCR 设置（可选）

如需支持扫描版 PDF，请安装额外的依赖项：

npm install tesseract.js pdfjs-dist canvas

注意： 在某些系统上，canvas 包可能需要构建工具。

重要任务规划说明（必须遵守）

始终规划并将工作拆分为许多小的待办任务
始终添加一个最终的审查待办任务，以验证工作质量并识别需要修复/改进的地方

🇺🇸English

[IMPORTANT] Use TaskCreate to break ALL work into small tasks BEFORE starting — including tasks for each file read. This prevents context loss from long files. For simple tasks, AI MUST ask user whether to skip.

Quick Summary

Goal: Convert PDF files to well-formatted Markdown with auto-detection of native text vs scanned documents.

Workflow:

Auto-Detect — Determine if PDF has native text or needs OCR
Convert — Run scripts/convert.cjs with input path and optional mode/output flags
Output — Returns JSON with success status, page count, and output path

Key Rules:

Use --mode auto (default) to let the tool decide native vs OCR
OCR for scanned PDFs requires additional tesseract.js setup
Complex multi-column layouts may not preserve structure perfectly

Be skeptical. Apply critical thinking, sequential thinking. Every claim needs traced proof, confidence percentages (Idea should be more than 80%).

pdf-to-markdown

Convert PDF files to Markdown format with automatic detection of native text vs scanned documents.

Installation Required

This skill requires npm dependencies. Run one of the following:

# Option 1: Install via ClaudeKit CLI (recommended)
ck init  # Runs install.sh which handles all skills

# Option 2: Manual installation
cd .claude/skills/pdf-to-markdown
npm install

Dependencies: @opendocsg/pdf2md (native PDFs), pdfjs-dist (PDF parsing)

Note: OCR for scanned PDFs requires additional setup (see OCR section).

Quick Start

# Basic conversion (auto-detect native vs scanned)
node .claude/skills/pdf-to-markdown/scripts/convert.cjs --input ./document.pdf

# Specify output path
node .claude/skills/pdf-to-markdown/scripts/convert.cjs -i ./doc.pdf -o ./output.md

# Force native mode (skip OCR detection)
node .claude/skills/pdf-to-markdown/scripts/convert.cjs -i ./doc.pdf --mode native

CLI Options

Option	Short	Description	Default
`--input`	`-i`	Input PDF file path	(required)
`--output`	`-o`	Output markdown file path	`{input}.md`
`--mode`	`-m`

Features

Auto-Detection: Automatically determines if PDF has native text or requires OCR
Native PDFs: Fast extraction using @opendocsg/pdf2md
Tables: Basic table structure preservation
Cross-Platform: Works on Windows, macOS, Linux
No System Dependencies: Pure JavaScript implementation

Conversion Modes

Auto (Default)

Checks if PDF has extractable text on first page. Uses native extraction if text found, otherwise falls back to OCR warning.

Native

Fast direct text extraction. Best for PDFs with selectable text (not scanned images).

OCR (Scanned PDFs) - Coming Soon

For scanned documents. Currently not implemented - the skill will notify you if a PDF appears to be scanned.

Output

Returns JSON on success:

{
    "success": true,
    "input": "/path/to/input.pdf",
    "output": "/path/to/output.md",
    "stats": {
        "pages": 5,
        "mode": "native"
    }
}

Limitations

Complex multi-column layouts may not preserve structure
Scanned PDF OCR accuracy depends on image quality
Mathematical formulas may not convert perfectly
First-run OCR downloads language data (~15MB)

OCR Setup (Optional)

For scanned PDF support, install additional dependencies:

npm install tesseract.js pdfjs-dist canvas

Note: The canvas package may require build tools on some systems.

IMPORTANT Task Planning Notes (MUST FOLLOW)

Always plan and break work into many small todo tasks
Always add a final review todo task to verify work quality and identify fixes/enhancements

Weekly Installs

361

Repository

duc01226/easyplatform

GitHub Stars

First Seen

Jan 24, 2026

Security Audits

Gen Agent Trust HubPass SocketPass SnykPass

Installed on

opencode331

codex328

gemini-cli325

github-copilot317

cursor311

amp303

Google Workspace CLI 团队负责人技能：自动化站会、任务协调与团队沟通工具

6,300 周安装

PDF转Markdown工具：自动检测原生/扫描文档，支持OCR转换

🇨🇳中文介绍

快速摘要

pdf-to-markdown

相关 Skills

安装要求

快速开始

CLI 选项

功能特性

转换模式

自动（默认）

原生

OCR（扫描版 PDF）- 即将推出

输出

限制

OCR 设置（可选）

🇺🇸English

Quick Summary

pdf-to-markdown

Installation Required

Quick Start

CLI Options

Features

Conversion Modes

Auto (Default)

Native

OCR (Scanned PDFs) - Coming Soon

Output

Limitations

OCR Setup (Optional)

最新 Skills