npx skills add https://github.com/inferen-sh/skills --skill explainer-video-guide通过 inference.sh CLI 从脚本到最终剪辑创建解说视频。
需要 inference.sh CLI (
infsh)。安装说明
infsh login
# 为解说视频生成一个场景
infsh app run google/veo-3-1-fast --input '{
"prompt": "Clean motion graphics style animation, abstract data flowing between connected nodes, blue and white color scheme, professional corporate aesthetic, smooth transitions"
}'
| 部分 | 时长 | 内容 | 字数 |
|---|---|---|---|
| 问题 | 10s | 陈述观众面临的痛点 | ~25 词 |
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
| 激化 | 10s | 展示问题比他们想象的更严重 | ~25 词 |
| 解决方案 | 15s | 介绍您的产品/想法 | ~35 词 |
| 工作原理 | 20s | 展示 3 个关键步骤或功能 | ~50 词 |
| 行动号召 | 5s | 一个清晰的下一步行动 | ~12 词 |
| 部分 | 时长 | 内容 |
|---|---|---|
| 之前 | 15s | 展示当前令人沮丧的状态 |
| 之后 | 15s | 展示理想的结果 |
| 桥梁 | 40s | 解释您的产品如何帮助他们实现目标 |
| 社会认同 | 10s | 快速统计数据或推荐 |
| 行动号召 | 10s | 清晰的下一步 |
| 部分 | 时长 | 内容 |
|---|---|---|
| 钩子 | 3s | 令人惊讶的事实或问题 |
| 功能 | 15s | 展示解决一个问题的功能 |
| 结果 | 7s | 结果/好处 |
| 行动号召 | 5s | 尝试/了解更多 |
| 内容类型 | 每分钟字数 | 备注 |
|---|---|---|
| 标准旁白 | 150 wpm | 对话速度 |
| 复杂/技术性 | 120 wpm | 留出处理时间 |
| 充满活力/社交 | 170 wpm | 短内容更快 |
| 儿童内容 | 100 wpm | 清晰且缓慢 |
关键规则: 每个关键信息一个场景。不要将多个想法塞进一个视觉画面中。
# 产品在上下文中
infsh app run google/veo-3-1-fast --input '{
"prompt": "Clean product demonstration video, hands typing on a laptop showing a dashboard interface, bright modern office, soft natural lighting, professional"
}'
# 抽象概念可视化
infsh app run bytedance/seedance-1-5-pro --input '{
"prompt": "Abstract motion graphics, colorful data streams connecting floating geometric shapes, smooth fluid animation, dark background with glowing elements, tech aesthetic"
}'
# 生活方式/结果镜头
infsh app run google/veo-3-1-fast --input '{
"prompt": "Happy person relaxing on couch with laptop, smiling at screen, bright airy living room, warm afternoon light, satisfied customer feeling, lifestyle commercial style"
}'
# 前后对比
infsh app run falai/flux-dev-lora --input '{
"prompt": "Split screen comparison, left side cluttered messy desk with papers and stress, right side clean organized minimalist workspace, dramatic difference, clean design"
}'
# 先生成静态帧
infsh app run falai/flux-dev-lora --input '{
"prompt": "Professional workspace with glowing holographic interface, futuristic but clean, blue accent lighting"
}'
# 为其添加动画
infsh app run falai/wan-2-5-i2v --input '{
"prompt": "Gentle camera push in, holographic elements subtly floating and rotating, soft ambient light shifts",
"image": "path/to/workspace-still.png"
}'
# 使用 Dia TTS 进行专业旁白
infsh app run falai/dia-tts --input '{
"prompt": "[S1] Tired of spending hours on reports that nobody reads? There is a better way. Meet DataFlow. It turns your raw data into visual stories... in seconds. Just connect your source, pick a template, and share. Try DataFlow free today."
}'
| 技巧 | 效果 | 示例 |
|---|---|---|
句号 . | 中等停顿 | "这改变了一切。方法如下。" |
省略号 ... | 长停顿 (戏剧性) | "而结果...是不可思议的。" |
逗号 , | 短停顿 | "快速、简单、强大。" |
感叹号 ! | 强调/活力 | "立即开始构建!" |
问号 ? | 语调上扬 | "如果有更好的方法呢?" |
音量: 低于旁白 20-30% (旁白播放时压低 6-12dB)
风格: 匹配品牌调性 (企业 = 氛围电子乐,初创公司 = 欢快独立音乐)
结构: 开场渐强 (前 3 秒) -> 旁白下微弱的循环 -> 行动号召时渐强
无人声: 旁白下仅使用器乐
infsh app run <music-gen-app> --input '{ "prompt": "upbeat corporate background music, modern electronic, 90 BPM, positive and professional, no vocals, suitable for product explainer video" }'
# 1. 生成旁白
infsh app run falai/dia-tts --input '{
"prompt": "[S1] Your script here..."
}'
# 2. 生成场景视觉 (并行)
infsh app run google/veo-3-1-fast --input '{"prompt": "scene 1 description"}' --no-wait
infsh app run google/veo-3-1-fast --input '{"prompt": "scene 2 description"}' --no-wait
infsh app run google/veo-3-1-fast --input '{"prompt": "scene 3 description"}' --no-wait
# 3. 将场景合并成序列
infsh app run infsh/media-merger --input '{
"media": ["scene1.mp4", "scene2.mp4", "scene3.mp4"]
}'
# 4. 为视频添加旁白
infsh app run infsh/video-audio-merger --input '{
"video": "merged-scenes.mp4",
"audio": "voiceover.mp3"
}'
# 5. 添加字幕
infsh app run infsh/caption-videos --input '{
"video": "final-with-audio.mp4",
"caption_file": "captions.srt"
}'
| 格式 | 长度 | 平台 |
|---|---|---|
| 社交预告 | 15-30s | TikTok, Instagram Reels, YouTube Shorts |
| 产品演示 | 60-90s | 网站,落地页 |
| 功能解说 | 90-120s | YouTube,电子邮件 |
| 教程/演练 | 2-5min | YouTube,帮助中心 |
| 投资者推介视频 | 2-3min | 推介演示文稿补充 |
| 转场 | 何时使用 | 效果 |
|---|---|---|
| 切 | 相关场景之间的默认转场 | 干净、专业 |
| 溶解/交叉淡入淡出 | 时间流逝,情绪转变 | 柔和、引人深思 |
| 擦除 | 新主题或新部分 | 清晰分离 |
| 缩放/推进 | 深入细节 | 聚焦注意力 |
| 匹配剪辑 | 场景间视觉相似性 | 巧妙、令人难忘 |
| 错误 | 问题 | 修复方法 |
|---|---|---|
| 脚本过于冗长 | 旁白仓促,观众不知所措 | 削减至最多 150 wpm |
| 前 3 秒没有钩子 | 观众立即离开 | 以问题或令人惊讶的数据开始 |
| 视觉滞后于旁白 | 令人困惑的脱节 | 视觉应与文字匹配或略微提前 |
| 背景音乐太大声 | 听不清旁白 | 将音乐音量压低至低于人声 6-12dB |
| 没有字幕 | 85% 的社交视频是静音观看的 | 始终添加字幕 |
| 想法太多 | 观众什么都记不住 | 每个视频一个核心信息 |
npx skills add inference-sh/skills@ai-video-generation
npx skills add inference-sh/skills@video-prompting-guide
npx skills add inference-sh/skills@text-to-speech
npx skills add inference-sh/skills@prompt-engineering
浏览所有应用:infsh app list
每周安装量
7.2K
仓库
GitHub 星标数
202
首次出现
14 天前
安全审计
安装于
claude-code5.8K
gemini-cli5.1K
codex5.1K
opencode5.1K
amp5.1K
kimi-cli5.1K
Create explainer videos from script to final cut via inference.sh CLI.
Requires inference.sh CLI (
infsh). Install instructions
infsh login
# Generate a scene for an explainer
infsh app run google/veo-3-1-fast --input '{
"prompt": "Clean motion graphics style animation, abstract data flowing between connected nodes, blue and white color scheme, professional corporate aesthetic, smooth transitions"
}'
| Section | Duration | Content | Word Count |
|---|---|---|---|
| Problem | 10s | State the pain point the viewer has | ~25 words |
| Agitate | 10s | Show why it's worse than they think | ~25 words |
| Solution | 15s | Introduce your product/idea | ~35 words |
| How It Works | 20s | Show 3 key steps or features | ~50 words |
| CTA | 5s | One clear next action | ~12 words |
| Section | Duration | Content |
|---|---|---|
| Before | 15s | Show the current frustrating state |
| After | 15s | Show the ideal outcome |
| Bridge | 40s | Explain how your product gets them there |
| Social Proof | 10s | Quick stat or testimonial |
| CTA | 10s | Clear next step |
| Section | Duration | Content |
|---|---|---|
| Hook | 3s | Surprising fact or question |
| Feature | 15s | Show one feature solving one problem |
| Result | 7s | The outcome/benefit |
| CTA | 5s | Try it / Learn more |
| Content Type | Words Per Minute | Notes |
|---|---|---|
| Standard narration | 150 wpm | Conversational pace |
| Complex/technical | 120 wpm | Allow processing time |
| Energetic/social | 170 wpm | Faster for short-form |
| Children's content | 100 wpm | Clear and slow |
Key rule: 1 scene per key message. Don't pack multiple ideas into one visual.
# Product in context
infsh app run google/veo-3-1-fast --input '{
"prompt": "Clean product demonstration video, hands typing on a laptop showing a dashboard interface, bright modern office, soft natural lighting, professional"
}'
# Abstract concept visualization
infsh app run bytedance/seedance-1-5-pro --input '{
"prompt": "Abstract motion graphics, colorful data streams connecting floating geometric shapes, smooth fluid animation, dark background with glowing elements, tech aesthetic"
}'
# Lifestyle/outcome shot
infsh app run google/veo-3-1-fast --input '{
"prompt": "Happy person relaxing on couch with laptop, smiling at screen, bright airy living room, warm afternoon light, satisfied customer feeling, lifestyle commercial style"
}'
# Before/after comparison
infsh app run falai/flux-dev-lora --input '{
"prompt": "Split screen comparison, left side cluttered messy desk with papers and stress, right side clean organized minimalist workspace, dramatic difference, clean design"
}'
# Generate a still frame first
infsh app run falai/flux-dev-lora --input '{
"prompt": "Professional workspace with glowing holographic interface, futuristic but clean, blue accent lighting"
}'
# Animate it
infsh app run falai/wan-2-5-i2v --input '{
"prompt": "Gentle camera push in, holographic elements subtly floating and rotating, soft ambient light shifts",
"image": "path/to/workspace-still.png"
}'
# Professional narration with Dia TTS
infsh app run falai/dia-tts --input '{
"prompt": "[S1] Tired of spending hours on reports that nobody reads? There is a better way. Meet DataFlow. It turns your raw data into visual stories... in seconds. Just connect your source, pick a template, and share. Try DataFlow free today."
}'
| Technique | Effect | Example |
|---|---|---|
Period . | Medium pause | "This changes everything. Here's how." |
Ellipsis ... | Long pause (dramatic) | "And the result... was incredible." |
Comma , | Short pause | "Fast, simple, powerful." |
Exclamation ! | Emphasis/energy | "Start building today!" |
Question ? | Rising intonation |
Volume: 20-30% under narration (duck 6-12dB when voice plays)
Style: match the brand tone (corporate = ambient electronic, startup = upbeat indie)
Structure: intro swell (first 3s) -> subtle loop under narration -> swell at CTA
No vocals: instrumental only under narration
infsh app run <music-gen-app> --input '{ "prompt": "upbeat corporate background music, modern electronic, 90 BPM, positive and professional, no vocals, suitable for product explainer video" }'
# 1. Generate voiceover
infsh app run falai/dia-tts --input '{
"prompt": "[S1] Your script here..."
}'
# 2. Generate scene visuals (in parallel)
infsh app run google/veo-3-1-fast --input '{"prompt": "scene 1 description"}' --no-wait
infsh app run google/veo-3-1-fast --input '{"prompt": "scene 2 description"}' --no-wait
infsh app run google/veo-3-1-fast --input '{"prompt": "scene 3 description"}' --no-wait
# 3. Merge scenes into sequence
infsh app run infsh/media-merger --input '{
"media": ["scene1.mp4", "scene2.mp4", "scene3.mp4"]
}'
# 4. Add voiceover to video
infsh app run infsh/video-audio-merger --input '{
"video": "merged-scenes.mp4",
"audio": "voiceover.mp3"
}'
# 5. Add captions
infsh app run infsh/caption-videos --input '{
"video": "final-with-audio.mp4",
"caption_file": "captions.srt"
}'
| Format | Length | Platform |
|---|---|---|
| Social teaser | 15-30s | TikTok, Instagram Reels, YouTube Shorts |
| Product demo | 60-90s | Website, landing page |
| Feature explainer | 90-120s | YouTube, email |
| Tutorial/walkthrough | 2-5min | YouTube, help center |
| Investor pitch video | 2-3min | Pitch deck supplement |
| Transition | When to Use | Effect |
|---|---|---|
| Cut | Default between related scenes | Clean, professional |
| Dissolve/Crossfade | Time passing, mood shift | Soft, contemplative |
| Wipe | New topic or section | Clear separation |
| Zoom/Push | Drilling into detail | Focus attention |
| Match cut | Visual similarity between scenes | Clever, memorable |
| Mistake | Problem | Fix |
|---|---|---|
| Script too wordy | Voiceover rushed, viewer overwhelmed | Cut to 150 wpm max |
| No hook in first 3s | Viewers leave immediately | Start with the problem or surprising stat |
| Visuals lag narration | Confusing disconnect | Visuals should match or slightly precede words |
| Background music too loud | Can't hear narration | Duck music 6-12dB under voice |
| No captions | 85% of social video watched silent | Always add captions |
| Too many ideas | Viewer retains nothing | One core message per video |
npx skills add inference-sh/skills@ai-video-generation
npx skills add inference-sh/skills@video-prompting-guide
npx skills add inference-sh/skills@text-to-speech
npx skills add inference-sh/skills@prompt-engineering
Browse all apps: infsh app list
Weekly Installs
7.2K
Repository
GitHub Stars
202
First Seen
14 days ago
Security Audits
Gen Agent Trust HubPassSocketPassSnykPass
Installed on
claude-code5.8K
gemini-cli5.1K
codex5.1K
opencode5.1K
amp5.1K
kimi-cli5.1K
53,600 周安装
| "What if there was a better way?" |