minimax-image-understanding by imsus/pi-extension-minimax-coding-plan-mcp
npx skills add https://github.com/imsus/pi-extension-minimax-coding-plan-mcp --skill minimax-image-understanding当你需要分析、描述或从图像中提取信息时,请使用此技能。
直接调用 understand_image 工具,并提供一个提示词和图像 URL:
understand_image({
prompt: "Your question about the image",
image_url: "https://example.com/image.png"
})
在以下场景中使用 understand_image:
在以下场景中不要使用 understand_image:
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
understand_image({
prompt: "What do you see in this image?",
image_url: "https://example.com/screenshot.png"
})
端点 : POST {api_host}/v1/coding_plan/vlm
请求体 :
{
"prompt": "Your question about the image",
"image_url": "data:image/jpeg;base64,/9j/4AAQ..."
}
响应格式 :
{
"content": "AI analysis of the image...",
"base_resp": {
"status_code": 0,
"status_msg": "success"
}
}
该工具自动处理三种类型的图像输入:
HTTP/HTTPS URL:下载图像并转换为 base64
https://example.com/image.jpg本地文件路径:读取本地文件并转换为 base64
/Users/username/Documents/image.pngimages/photo.png@ 前缀,会将其移除Base64 数据 URL:直接传递现有的 base64 数据
data:image/png;base64,iVBORw0KGgo...支持:
不支持:
understand_image({
prompt: "What is the error message and where is it located in this screenshot?",
image_url: "./error-screenshot.png"
})
understand_image({
prompt: "What code is shown in this screenshot? Please transcribe it exactly.",
image_url: "https://example.com/code.png"
})
understand_image({
prompt: "Analyze this UI design. What is working well and what could be improved?",
image_url: "https://example.com/mockup.png"
})
understand_image({
prompt: "Extract all text from this image",
image_url: "/Users/username/Documents/scan.png"
})
每周安装量
256
代码仓库
GitHub 星标数
2
首次出现
2026年2月18日
安全审计
安装于
opencode254
gemini-cli252
github-copilot252
codex252
cursor251
kimi-cli250
Use this skill when you need to analyze, describe, or extract information from images.
Call the understand_image tool directly with a prompt and image URL:
understand_image({
prompt: "Your question about the image",
image_url: "https://example.com/image.png"
})
Use understand_image when:
Do NOT use understand_image when:
understand_image({
prompt: "What do you see in this image?",
image_url: "https://example.com/screenshot.png"
})
Endpoint : POST {api_host}/v1/coding_plan/vlm
Request Body :
{
"prompt": "Your question about the image",
"image_url": "data:image/jpeg;base64,/9j/4AAQ..."
}
Response Format :
{
"content": "AI analysis of the image...",
"base_resp": {
"status_code": 0,
"status_msg": "success"
}
}
The tool automatically handles three types of image inputs:
HTTP/HTTPS URLs : Downloads the image and converts to base64
https://example.com/image.jpgLocal file paths : Reads local files and converts to base64
/Users/username/Documents/image.pngimages/photo.png@ prefix if presentBase64 data URLs : Passes through existing base64 data
data:image/png;base64,iVBORw0KGgo...Supported:
Not supported:
understand_image({
prompt: "What is the error message and where is it located in this screenshot?",
image_url: "./error-screenshot.png"
})
understand_image({
prompt: "What code is shown in this screenshot? Please transcribe it exactly.",
image_url: "https://example.com/code.png"
})
understand_image({
prompt: "Analyze this UI design. What is working well and what could be improved?",
image_url: "https://example.com/mockup.png"
})
understand_image({
prompt: "Extract all text from this image",
image_url: "/Users/username/Documents/scan.png"
})
Weekly Installs
256
Repository
GitHub Stars
2
First Seen
Feb 18, 2026
Security Audits
Gen Agent Trust HubFailSocketPassSnykFail
Installed on
opencode254
gemini-cli252
github-copilot252
codex252
cursor251
kimi-cli250
React 组合模式指南:Vercel 组件架构最佳实践,提升代码可维护性
106,200 周安装
Nx Import 使用指南:从源仓库导入代码并保留Git历史
250 周安装
OpenPencil CLI 工具:.fig 设计文件命令行操作与 MCP 服务器 | 设计自动化
250 周安装
学术深度研究技能:AI驱动的学术文献综述与多源验证工具,生成APA格式报告
250 周安装
React PDF 渲染器 - 使用 JSON 生成 PDF 文档,支持自定义组件和流式渲染
250 周安装
后端安全编码专家 | 安全开发实践、漏洞预防与防御性编程技术指南
250 周安装
TanStack Form:高性能无头表单库,支持TypeScript、Zod、Valibot验证
250 周安装