重要前提
安装AI Skills的关键前提是:必须科学上网,且开启TUN模式,这一点至关重要,直接决定安装能否顺利完成,在此郑重提醒三遍:科学上网,科学上网,科学上网。查看完整安装教程 →
image-to-text by pascalorg/skills
npx skills add https://github.com/pascalorg/skills --skill image-to-text使用 OCR(Tesseract)从图像中提取所有可读文本。返回完整的文本内容以及单词级别的边界框和置信度分数。
bash <skill-path>/scripts/image-to-text.sh <image-path> [language]
参数:
image-path — 图像文件的路径(必需)language — OCR 语言代码(可选,默认为 eng)。常见代码:eng、fra、、、、广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
deuspachi_simjpn示例:
# 从屏幕截图中提取文本
bash <skill-path>/scripts/image-to-text.sh ./screenshot.png
# 提取法语文本
bash <skill-path>/scripts/image-to-text.sh ./mockup.png fra
{
"text": "Request work\nSuggestions\nPlumbing\nHVAC\nCleaning\nElectrical",
"confidence": 87.4,
"words": [
{
"text": "Request",
"confidence": 94.2,
"bbox": { "x0": 142, "y0": 180, "x1": 268, "y1": 204 }
},
{
"text": "work",
"confidence": 96.1,
"bbox": { "x0": 274, "y0": 180, "x1": 332, "y1": 204 }
}
],
"lines": [
{
"text": "Request work",
"confidence": 95.1,
"bbox": { "x0": 142, "y0": 180, "x1": 332, "y1": 204 }
}
]
}
| 字段 | 类型 | 描述 |
|---|---|---|
| text | String | 提取的完整文本,以换行符分隔 |
| confidence | Number | 整体置信度分数(0-100) |
| words | Array | 每个单词包含文本、置信度和边界框 |
| lines | Array | 每行包含文本、置信度和边界框 |
提取文本后,按行分组呈现内容:
提取的文本(置信度 87.4%):
Request work
Suggestions
Plumbing
HVAC
Cleaning
Electrical
找到 6 行,6 个单词。
在根据设计实现 UI 文案时,可以直接使用提取的文本。
置信度低 / 文本乱码 — Tesseract 在处理干净、高对比度的文本时效果最佳。渲染后的 UI 屏幕截图效果很好。角度倾斜或有噪点的文本照片可能导致结果不佳。
语言错误 — 将正确的语言代码作为第二个参数传递。Tesseract 需要正确的语言模型来识别字符。
首次运行较慢 — Tesseract 在首次运行时需要下载语言数据(英语约 4MB)。后续运行会更快。
每周安装次数
54
代码仓库
GitHub 星标数
17
首次出现
Mar 6, 2026
安全审计
安装于
kimi-cli54
gemini-cli54
github-copilot54
amp54
cline54
codex54
Extract all readable text from an image using OCR (Tesseract). Returns the full text content along with word-level bounding boxes and confidence scores.
bash <skill-path>/scripts/image-to-text.sh <image-path> [language]
Arguments:
image-path — Path to the image file (required)language — OCR language code (optional, defaults to eng). Common: eng, fra, deu, spa, chi_sim, jpnExamples:
# Extract text from a screenshot
bash <skill-path>/scripts/image-to-text.sh ./screenshot.png
# Extract French text
bash <skill-path>/scripts/image-to-text.sh ./mockup.png fra
{
"text": "Request work\nSuggestions\nPlumbing\nHVAC\nCleaning\nElectrical",
"confidence": 87.4,
"words": [
{
"text": "Request",
"confidence": 94.2,
"bbox": { "x0": 142, "y0": 180, "x1": 268, "y1": 204 }
},
{
"text": "work",
"confidence": 96.1,
"bbox": { "x0": 274, "y0": 180, "x1": 332, "y1": 204 }
}
],
"lines": [
{
"text": "Request work",
"confidence": 95.1,
"bbox": { "x0": 142, "y0": 180, "x1": 332, "y1": 204 }
}
]
}
| Field | Type | Description |
|---|---|---|
| text | String | Full extracted text, newline-separated |
| confidence | Number | Overall confidence score (0-100) |
| words | Array | Each word with text, confidence, and bounding box |
| lines | Array | Each line with text, confidence, and bounding box |
After extracting text, present the content grouped by lines:
Extracted text (87.4% confidence):
Request work
Suggestions
Plumbing
HVAC
Cleaning
Electrical
Found 6 lines, 6 words.
Use the extracted text directly when implementing UI copy from a design.
Low confidence / garbled text — Tesseract works best with clean, high-contrast text. Screenshots of rendered UI work well. Photos of text at angles or with noise may produce poor results.
Wrong language — Pass the correct language code as the second argument. Tesseract needs the right language model to recognize characters.
First run is slow — Tesseract downloads language data (~4MB for English) on the first run. Subsequent runs are faster.
Weekly Installs
54
Repository
GitHub Stars
17
First Seen
Mar 6, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykPass
Installed on
kimi-cli54
gemini-cli54
github-copilot54
amp54
cline54
codex54
Skills CLI 使用指南:AI Agent 技能包管理器安装与管理教程
50,200 周安装
Mantine Form 表单库:React 表单验证与状态管理解决方案 | 开源 UI 组件
297 周安装
数据管道架构设计指南:ETL/ELT、Lambda、Kappa、Lakehouse架构与成本优化
287 周安装
SaaS定价策略指南:基于价值的定价模型、范·韦斯滕多普测试与货币化方案
285 周安装
Ionic Framework 设计指南:构建原生外观移动应用的 UI 组件库与主题定制教程
291 周安装
数学计算工具指南:SymPy、Z3、SciPy等符号计算与定理证明工具详解
288 周安装
Sentry Flutter SDK 完整设置指南:错误监控、性能剖析、会话回放
295 周安装