VideoAgent AI图像生成器 | 8大模型一键生成图片、插画、Logo、海报

videoagent-image-studio by pexoai/pexo-skills

5,000 周安装量

388 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/pexoai/pexo-skills --skill videoagent-image-studio

AI/机器学习开发设计

🇨🇳中文介绍

🎨 VideoAgent 图像工作室

使用场景： 当用户要求生成、绘制、创建或制作任何类型的图像、照片、插图、图标、标志或艺术作品时使用。

使用 8 种先进的 AI 模型生成图像。此技能会自动为任务选择最佳模型并处理所有复杂性——包括 Midjourney 的异步轮询——因此您可以专注于对话。

快速参考

用户意图	模型	速度
艺术性、电影感、绘画风格	`midjourney`	~15秒
照片级真实感、肖像、产品	`flux-pro`	~8秒
通用、均衡	`flux-dev`	~10秒
快速草稿、快速迭代	`flux-schnell`

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

步骤 1 — 优化提示词

在调用脚本之前，根据所选模型，使用适当的风格、光照和质量描述符来扩展用户的提示词。

Midjourney : 添加 cinematic lighting, ultra detailed, --v 7, --style raw
Flux : 添加 masterpiece, highly detailed, sharp focus, professional photography
Ideogram : 明确说明文字内容、字体样式和布局
Recraft : 指定 vector illustration, flat design, icon style

步骤 2 — 运行脚本

node {baseDir}/tools/generate.js \
  --model <model_id> \
  --prompt "<enhanced prompt>" \
  --aspect-ratio <ratio>

参数	默认值	描述
`--model`	`flux-dev`	上表中的模型 ID
`--prompt`	(必填)	图像生成提示词
`--aspect-ratio`	`1:1`	`1:1`, `16:9`, `9:16`, `4:3`, `3:4`, `3:2`, `21:9`
`--num-images`	`1`	图像数量 (1–4；Midjourney 始终返回 4 张)
`--negative-prompt`	—	需要避免的内容 (Midjourney 不支持)
`--seed`	—	用于可重现性的种子值

步骤 3 — 返回结果

脚本始终会等待并返回最终的图像 URL。无需轮询。

{
  "success": true,
  "model": "flux-pro",
  "imageUrl": "https://...",
  "images": ["https://..."]
}

将 imageUrl 发送给用户。

使用 Midjourney 生成 4 张图像网格后，为用户提供以下选项：

# 放大图像 #2 (精细，保留细节)
node {baseDir}/tools/generate.js \
  --model midjourney \
  --action upscale \
  --index 2 \
  --job-id <job_id>

# 创建图像 #3 的强烈变体
node {baseDir}/tools/generate.js \
  --model midjourney \
  --action variation \
  --index 3 \
  --job-id <job_id> \
  --variation-type 1

# 使用相同提示词重新生成
node {baseDir}/tools/generate.js \
  --model midjourney \
  --action reroll \
  --job-id <job_id>

放大类型： 0 = 精细 (默认，最适合照片), 1 = 创意 (最适合插图)

变体类型： 0 = 精细 (默认), 1 = 强烈 (显著变化)

用户： "绘制一只雪豹在雪山上，要有电影感的光照"

# 选择 midjourney 以获得艺术质量
node {baseDir}/tools/generate.js \
  --model midjourney \
  --prompt "a majestic snow leopard on a snowy mountain peak, cinematic lighting, dramatic atmosphere, ultra detailed --ar 16:9 --v 7" \
  --aspect-ratio 16:9

🎨 完成！要放大哪一张？(U1-U4) 还是创建变体？(V1-V4)

用户： "使用 Flux 生成一个香水产品海报，白色背景"

# 选择 flux-pro 以获得照片级真实的产品拍摄效果
node {baseDir}/tools/generate.js \
  --model flux-pro \
  --prompt "a luxury perfume bottle on a clean white background, professional product photography, soft shadows, 8k, highly detailed" \
  --aspect-ratio 3:4

用户： "给我看一个快速草稿"

# 使用 flux-schnell 进行即时预览
node {baseDir}/tools/generate.js \
  --model flux-schnell \
  --prompt "..." \
  --aspect-ratio 1:1

用户： "给我制作一个应用图标，扁平风格，蓝色主题"

# 使用 recraft 获得矢量/图标风格
node {baseDir}/tools/generate.js \
  --model recraft \
  --prompt "a minimal flat design app icon, blue color scheme, simple geometric shapes, vector style, white background"

无需 API 密钥！ 所有请求都通过一个托管代理服务器进行，该服务器在服务端处理身份验证。

此技能开箱即用——只需安装并使用即可。

高级：自定义代理或令牌

如果您想使用自己的代理服务器或持久令牌，请设置以下环境变量：

{
  "skills": {
    "entries": {
      "videoagent-image-studio": {
        "enabled": true,
        "env": {
          "IMAGE_STUDIO_PROXY_URL": "https://your-proxy.vercel.app",
          "IMAGE_STUDIO_TOKEN": "your_token_here"
        }
      }
    }
  }
}

变量	是否必需	描述
`IMAGE_STUDIO_PROXY_URL`	否	自定义代理基础 URL (默认: `https://image-gen-proxy.vercel.app`)
`IMAGE_STUDIO_TOKEN`	否	持久令牌 (如果未设置则自动获取，每个令牌免费使用 100 次)

要部署您自己的代理服务器，请参考 videoagent-audio-studio proxy 作为参考实现。您需要将 FAL_KEY 和 LEGNEXT_KEY 作为 Vercel 环境变量。

简化异步处理 : 脚本现在会阻塞直到 Midjourney 完成。SKILL.md 说明中不再需要 --async / --poll 标志。
统一输出格式 : 所有模型返回相同的 { success, imageUrl, images } 结构。
Nano Banana 的参考图像 : 传递 --reference-images "url1,url2" 以在多次生成间保持角色/风格一致性。

为 Midjourney 添加了非阻塞异步模式 (--async + --poll)。

默认启用 Midjourney 极速模式 (~10-20秒)。

将 Midjourney 提供商从 TTAPI 切换为 Legnext.ai，以获得更好的稳定性。

初始版本，包含 Midjourney、Flux、SDXL、Nano Banana、Ideogram、Recraft。

🇺🇸English

🎨 VideoAgent Image Studio

Use when: User asks to generate, draw, create, or make any kind of image, photo, illustration, icon, logo, or artwork.

Generate images with 8 state-of-the-art AI models. This skill automatically picks the best model for the job and handles all the complexity — including Midjourney's async polling — so you can focus on the conversation.

Quick Reference

User Intent	Model	Speed
Artistic, cinematic, painterly	`midjourney`	~15s
Photorealistic, portrait, product	`flux-pro`	~8s
General purpose, balanced	`flux-dev`	~10s
Quick draft, fast iteration	`flux-schnell`	~2s
Image with text, logo, poster	`ideogram`	~10s
Vector art, icon, flat design	`recraft`	~8s
Anime, stylized illustration	`sdxl`	~5s
Gemini-powered, consistent style	`nano-banana`	~12s

How to Generate an Image

Step 1 — Enhance the prompt

Before calling the script, expand the user's prompt with style, lighting, and quality descriptors appropriate for the chosen model.

Midjourney : Add cinematic lighting, ultra detailed, --v 7, --style raw
Flux : Add masterpiece, highly detailed, sharp focus, professional photography
Ideogram : Be explicit about text content, font style, and layout
Recraft : Specify vector illustration, flat design,

Step 2 — Run the script

node {baseDir}/tools/generate.js \
  --model <model_id> \
  --prompt "<enhanced prompt>" \
  --aspect-ratio <ratio>

All parameters:

Parameter	Default	Description
`--model`	`flux-dev`	Model ID from the table above
`--prompt`	(required)	The image generation prompt
`--aspect-ratio`	`1:1`	`1:1`, `16:9`, , , , ,

Step 3 — Return the result

The script always waits and returns the final image URL(s). No polling required.

{
  "success": true,
  "model": "flux-pro",
  "imageUrl": "https://...",
  "images": ["https://..."]
}

Send the imageUrl to the user.

Midjourney Actions

After generating a 4-image grid with Midjourney, offer the user these options:

# Upscale image #2 (subtle, preserves details)
node {baseDir}/tools/generate.js \
  --model midjourney \
  --action upscale \
  --index 2 \
  --job-id <job_id>

# Create a strong variation of image #3
node {baseDir}/tools/generate.js \
  --model midjourney \
  --action variation \
  --index 3 \
  --job-id <job_id> \
  --variation-type 1

# Regenerate with same prompt
node {baseDir}/tools/generate.js \
  --model midjourney \
  --action reroll \
  --job-id <job_id>

Upscale types: 0 = Subtle (default, best for photos), 1 = Creative (best for illustrations)

Variation types: 0 = Subtle (default), 1 = Strong (dramatic changes)

Example Conversations

User: "Draw a snow leopard on a snowy mountain with cinematic lighting"

# Choose midjourney for artistic quality
node {baseDir}/tools/generate.js \
  --model midjourney \
  --prompt "a majestic snow leopard on a snowy mountain peak, cinematic lighting, dramatic atmosphere, ultra detailed --ar 16:9 --v 7" \
  --aspect-ratio 16:9

🎨 Done! Which one to upscale? (U1-U4) Or create a variant? (V1-V4)

User: "Use Flux to generate a perfume product poster, white background"

# Choose flux-pro for photorealistic product shots
node {baseDir}/tools/generate.js \
  --model flux-pro \
  --prompt "a luxury perfume bottle on a clean white background, professional product photography, soft shadows, 8k, highly detailed" \
  --aspect-ratio 3:4

User: "Show me a quick draft"

# flux-schnell for instant previews
node {baseDir}/tools/generate.js \
  --model flux-schnell \
  --prompt "..." \
  --aspect-ratio 1:1

User: "Make me an App icon, flat style, blue theme"

# recraft for vector/icon style
node {baseDir}/tools/generate.js \
  --model recraft \
  --prompt "a minimal flat design app icon, blue color scheme, simple geometric shapes, vector style, white background"

Setup

Zero API keys needed! All requests go through a hosted proxy that handles authentication server-side.

The skill works out of the box — just install and use.

Advanced: Custom proxy or token

If you want to use your own proxy or a persistent token, set these environment variables:

{
  "skills": {
    "entries": {
      "videoagent-image-studio": {
        "enabled": true,
        "env": {
          "IMAGE_STUDIO_PROXY_URL": "https://your-proxy.vercel.app",
          "IMAGE_STUDIO_TOKEN": "your_token_here"
        }
      }
    }
  }
}

Variable	Required	Description
`IMAGE_STUDIO_PROXY_URL`	No	Custom proxy base URL (default: `https://image-gen-proxy.vercel.app`)
`IMAGE_STUDIO_TOKEN`	No	Persistent token (auto-obtained if not set, 100 free uses per token)

To deploy your own proxy, see the videoagent-audio-studio proxy as a reference implementation. You'll need FAL_KEY and LEGNEXT_KEY as Vercel environment variables.

Changelog

v2.0.0

Simplified async : The script now blocks until Midjourney completes. No more --async / --poll flags needed in SKILL.md instructions.
Unified output format : All models return the same { success, imageUrl, images } shape.
Reference images for Nano Banana : Pass --reference-images "url1,url2" for character/style consistency across generations.

v1.3.0

Added non-blocking async mode for Midjourney (--async + --poll).

v1.2.0

Midjourney turbo mode enabled by default (~10-20s).

v1.1.0

Switched Midjourney provider from TTAPI to Legnext.ai for better stability.

v1.0.0

Initial release with Midjourney, Flux, SDXL, Nano Banana, Ideogram, Recraft.

Weekly Installs

5.0K

Repository

pexoai/pexo-skills

GitHub Stars

358

First Seen

Mar 6, 2026

Security Audits

Gen Agent Trust HubPass SocketPass SnykPass

Installed on

openclaw3.6K

claude-code3.5K

gemini-cli1.6K

cursor1.6K

kimi-cli1.6K

codex1.6K

React 组合模式指南：Vercel 组件架构最佳实践，提升代码可维护性

102,200 周安装

VideoAgent AI图像生成器 | 8大模型一键生成图片、插画、Logo、海报

🇨🇳中文介绍

🎨 VideoAgent 图像工作室

快速参考

相关 Skills

如何生成图像

步骤 1 — 优化提示词

步骤 2 — 运行脚本

步骤 3 — 返回结果

Midjourney 操作

对话示例

设置

高级：自定义代理或令牌

更新日志

v2.0.0

v1.3.0

v1.2.0

v1.1.0

v1.0.0

🇺🇸English

🎨 VideoAgent Image Studio

Quick Reference

How to Generate an Image

Step 1 — Enhance the prompt

Step 2 — Run the script

Step 3 — Return the result

Midjourney Actions

Example Conversations

Setup

Advanced: Custom proxy or token

Changelog

v2.0.0

v1.3.0

v1.2.0

v1.1.0

v1.0.0

最新 Skills