videoagent-image-studio by pexoai/pexo-skills
npx skills add https://github.com/pexoai/pexo-skills --skill videoagent-image-studio使用场景: 当用户要求生成、绘制、创建或制作任何类型的图像、照片、插图、图标、标志或艺术作品时使用。
使用 8 种先进的 AI 模型生成图像。此技能会自动为任务选择最佳模型并处理所有复杂性——包括 Midjourney 的异步轮询——因此您可以专注于对话。
| 用户意图 | 模型 | 速度 |
|---|---|---|
| 艺术性、电影感、绘画风格 | midjourney | ~15秒 |
| 照片级真实感、肖像、产品 | flux-pro | ~8秒 |
| 通用、均衡 | flux-dev | ~10秒 |
| 快速草稿、快速迭代 | flux-schnell |
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
| ~2秒 |
| 带文字的图像、标志、海报 | ideogram | ~10秒 |
| 矢量艺术、图标、扁平化设计 | recraft | ~8秒 |
| 动漫、风格化插图 | sdxl | ~5秒 |
| Gemini 驱动、风格一致 | nano-banana | ~12秒 |
在调用脚本之前,根据所选模型,使用适当的风格、光照和质量描述符来扩展用户的提示词。
cinematic lighting, ultra detailed, --v 7, --style rawmasterpiece, highly detailed, sharp focus, professional photographyvector illustration, flat design, icon stylenode {baseDir}/tools/generate.js \
--model <model_id> \
--prompt "<enhanced prompt>" \
--aspect-ratio <ratio>
所有参数:
| 参数 | 默认值 | 描述 |
|---|---|---|
--model | flux-dev | 上表中的模型 ID |
--prompt | (必填) | 图像生成提示词 |
--aspect-ratio | 1:1 | 1:1, 16:9, 9:16, 4:3, 3:4, 3:2, 21:9 |
--num-images | 1 | 图像数量 (1–4;Midjourney 始终返回 4 张) |
--negative-prompt | — | 需要避免的内容 (Midjourney 不支持) |
--seed | — | 用于可重现性的种子值 |
脚本始终会等待并返回最终的图像 URL。无需轮询。
{
"success": true,
"model": "flux-pro",
"imageUrl": "https://...",
"images": ["https://..."]
}
将 imageUrl 发送给用户。
使用 Midjourney 生成 4 张图像网格后,为用户提供以下选项:
# 放大图像 #2 (精细,保留细节)
node {baseDir}/tools/generate.js \
--model midjourney \
--action upscale \
--index 2 \
--job-id <job_id>
# 创建图像 #3 的强烈变体
node {baseDir}/tools/generate.js \
--model midjourney \
--action variation \
--index 3 \
--job-id <job_id> \
--variation-type 1
# 使用相同提示词重新生成
node {baseDir}/tools/generate.js \
--model midjourney \
--action reroll \
--job-id <job_id>
放大类型: 0 = 精细 (默认,最适合照片), 1 = 创意 (最适合插图)
变体类型: 0 = 精细 (默认), 1 = 强烈 (显著变化)
用户: "绘制一只雪豹在雪山上,要有电影感的光照"
# 选择 midjourney 以获得艺术质量
node {baseDir}/tools/generate.js \
--model midjourney \
--prompt "a majestic snow leopard on a snowy mountain peak, cinematic lighting, dramatic atmosphere, ultra detailed --ar 16:9 --v 7" \
--aspect-ratio 16:9
🎨 完成!要放大哪一张?(U1-U4) 还是创建变体?(V1-V4)
用户: "使用 Flux 生成一个香水产品海报,白色背景"
# 选择 flux-pro 以获得照片级真实的产品拍摄效果
node {baseDir}/tools/generate.js \
--model flux-pro \
--prompt "a luxury perfume bottle on a clean white background, professional product photography, soft shadows, 8k, highly detailed" \
--aspect-ratio 3:4
用户: "给我看一个快速草稿"
# 使用 flux-schnell 进行即时预览
node {baseDir}/tools/generate.js \
--model flux-schnell \
--prompt "..." \
--aspect-ratio 1:1
用户: "给我制作一个应用图标,扁平风格,蓝色主题"
# 使用 recraft 获得矢量/图标风格
node {baseDir}/tools/generate.js \
--model recraft \
--prompt "a minimal flat design app icon, blue color scheme, simple geometric shapes, vector style, white background"
无需 API 密钥! 所有请求都通过一个托管代理服务器进行,该服务器在服务端处理身份验证。
此技能开箱即用——只需安装并使用即可。
如果您想使用自己的代理服务器或持久令牌,请设置以下环境变量:
{
"skills": {
"entries": {
"videoagent-image-studio": {
"enabled": true,
"env": {
"IMAGE_STUDIO_PROXY_URL": "https://your-proxy.vercel.app",
"IMAGE_STUDIO_TOKEN": "your_token_here"
}
}
}
}
}
| 变量 | 是否必需 | 描述 |
|---|---|---|
IMAGE_STUDIO_PROXY_URL | 否 | 自定义代理基础 URL (默认: https://image-gen-proxy.vercel.app) |
IMAGE_STUDIO_TOKEN | 否 | 持久令牌 (如果未设置则自动获取,每个令牌免费使用 100 次) |
要部署您自己的代理服务器,请参考 videoagent-audio-studio proxy 作为参考实现。您需要将 FAL_KEY 和 LEGNEXT_KEY 作为 Vercel 环境变量。
--async / --poll 标志。{ success, imageUrl, images } 结构。--reference-images "url1,url2" 以在多次生成间保持角色/风格一致性。--async + --poll)。每周安装量
5.0K
仓库
GitHub 星标数
358
首次出现
2026年3月6日
安全审计
安装于
openclaw3.6K
claude-code3.5K
gemini-cli1.6K
cursor1.6K
kimi-cli1.6K
codex1.6K
Use when: User asks to generate, draw, create, or make any kind of image, photo, illustration, icon, logo, or artwork.
Generate images with 8 state-of-the-art AI models. This skill automatically picks the best model for the job and handles all the complexity — including Midjourney's async polling — so you can focus on the conversation.
| User Intent | Model | Speed |
|---|---|---|
| Artistic, cinematic, painterly | midjourney | ~15s |
| Photorealistic, portrait, product | flux-pro | ~8s |
| General purpose, balanced | flux-dev | ~10s |
| Quick draft, fast iteration | flux-schnell | ~2s |
| Image with text, logo, poster | ideogram | ~10s |
| Vector art, icon, flat design | recraft | ~8s |
| Anime, stylized illustration | sdxl | ~5s |
| Gemini-powered, consistent style | nano-banana | ~12s |
Before calling the script, expand the user's prompt with style, lighting, and quality descriptors appropriate for the chosen model.
cinematic lighting, ultra detailed, --v 7, --style rawmasterpiece, highly detailed, sharp focus, professional photographyvector illustration, flat design, node {baseDir}/tools/generate.js \
--model <model_id> \
--prompt "<enhanced prompt>" \
--aspect-ratio <ratio>
All parameters:
| Parameter | Default | Description |
|---|---|---|
--model | flux-dev | Model ID from the table above |
--prompt | (required) | The image generation prompt |
--aspect-ratio | 1:1 | 1:1, 16:9, , , , , |
The script always waits and returns the final image URL(s). No polling required.
{
"success": true,
"model": "flux-pro",
"imageUrl": "https://...",
"images": ["https://..."]
}
Send the imageUrl to the user.
After generating a 4-image grid with Midjourney, offer the user these options:
# Upscale image #2 (subtle, preserves details)
node {baseDir}/tools/generate.js \
--model midjourney \
--action upscale \
--index 2 \
--job-id <job_id>
# Create a strong variation of image #3
node {baseDir}/tools/generate.js \
--model midjourney \
--action variation \
--index 3 \
--job-id <job_id> \
--variation-type 1
# Regenerate with same prompt
node {baseDir}/tools/generate.js \
--model midjourney \
--action reroll \
--job-id <job_id>
Upscale types: 0 = Subtle (default, best for photos), 1 = Creative (best for illustrations)
Variation types: 0 = Subtle (default), 1 = Strong (dramatic changes)
User: "Draw a snow leopard on a snowy mountain with cinematic lighting"
# Choose midjourney for artistic quality
node {baseDir}/tools/generate.js \
--model midjourney \
--prompt "a majestic snow leopard on a snowy mountain peak, cinematic lighting, dramatic atmosphere, ultra detailed --ar 16:9 --v 7" \
--aspect-ratio 16:9
🎨 Done! Which one to upscale? (U1-U4) Or create a variant? (V1-V4)
User: "Use Flux to generate a perfume product poster, white background"
# Choose flux-pro for photorealistic product shots
node {baseDir}/tools/generate.js \
--model flux-pro \
--prompt "a luxury perfume bottle on a clean white background, professional product photography, soft shadows, 8k, highly detailed" \
--aspect-ratio 3:4
User: "Show me a quick draft"
# flux-schnell for instant previews
node {baseDir}/tools/generate.js \
--model flux-schnell \
--prompt "..." \
--aspect-ratio 1:1
User: "Make me an App icon, flat style, blue theme"
# recraft for vector/icon style
node {baseDir}/tools/generate.js \
--model recraft \
--prompt "a minimal flat design app icon, blue color scheme, simple geometric shapes, vector style, white background"
Zero API keys needed! All requests go through a hosted proxy that handles authentication server-side.
The skill works out of the box — just install and use.
If you want to use your own proxy or a persistent token, set these environment variables:
{
"skills": {
"entries": {
"videoagent-image-studio": {
"enabled": true,
"env": {
"IMAGE_STUDIO_PROXY_URL": "https://your-proxy.vercel.app",
"IMAGE_STUDIO_TOKEN": "your_token_here"
}
}
}
}
}
| Variable | Required | Description |
|---|---|---|
IMAGE_STUDIO_PROXY_URL | No | Custom proxy base URL (default: https://image-gen-proxy.vercel.app) |
IMAGE_STUDIO_TOKEN | No | Persistent token (auto-obtained if not set, 100 free uses per token) |
To deploy your own proxy, see the videoagent-audio-studio proxy as a reference implementation. You'll need FAL_KEY and LEGNEXT_KEY as Vercel environment variables.
--async / --poll flags needed in SKILL.md instructions.{ success, imageUrl, images } shape.--reference-images "url1,url2" for character/style consistency across generations.--async + --poll).Weekly Installs
5.0K
Repository
GitHub Stars
358
First Seen
Mar 6, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykPass
Installed on
openclaw3.6K
claude-code3.5K
gemini-cli1.6K
cursor1.6K
kimi-cli1.6K
codex1.6K
React 组合模式指南:Vercel 组件架构最佳实践,提升代码可维护性
102,200 周安装
icon style9:164:33:43:221:9--num-images | 1 | Number of images (1–4; Midjourney always returns 4) |
--negative-prompt | — | Things to avoid (not supported by Midjourney) |
--seed | — | Seed for reproducibility |