ai-avatar-video by inferen-sh/skills
npx skills add https://github.com/inferen-sh/skills --skill ai-avatar-video通过 inference.sh CLI 创建 AI 虚拟形象和说话头部视频。

需要 inference.sh CLI (
infsh)。安装说明
infsh login
# 从图像 + 音频创建虚拟形象视频
infsh app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://portrait.jpg",
"audio_url": "https://speech.mp3"
}'
| 模型 | 应用 ID | 最佳用途 |
|---|---|---|
| OmniHuman 1.5 | bytedance/omnihuman-1-5 |
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
| 多角色,最佳质量 |
| OmniHuman 1.0 | bytedance/omnihuman-1-0 | 单角色 |
| Fabric 1.0 | falai/fabric-1-0 | 带口型同步的图像说话 |
| PixVerse 口型同步 | falai/pixverse-lipsync | 高度逼真 |
infsh app list --search "omnihuman"
infsh app list --search "lipsync"
infsh app list --search "fabric"
infsh app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://portrait.jpg",
"audio_url": "https://speech.mp3"
}'
支持在多人物图像中指定驱动哪个角色。
infsh app run falai/fabric-1-0 --input '{
"image_url": "https://face.jpg",
"audio_url": "https://audio.mp3"
}'
infsh app run falai/pixverse-lipsync --input '{
"image_url": "https://portrait.jpg",
"audio_url": "https://speech.mp3"
}'
从任何音频生成高度逼真的口型同步。
# 1. 从文本生成语音
infsh app run infsh/kokoro-tts --input '{
"prompt": "Welcome to our product demo. Today I will show you..."
}' > speech.json
# 2. 使用语音创建虚拟形象视频
infsh app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://presenter-photo.jpg",
"audio_url": "<audio-url-from-step-1>"
}'
# 1. 转录原始视频
infsh app run infsh/fast-whisper-large-v3 --input '{"audio_url": "https://video.mp4"}' > transcript.json
# 2. 翻译文本(手动或使用 LLM)
# 3. 生成新语言的语音
infsh app run infsh/kokoro-tts --input '{"text": "<translated-text>"}' > new_speech.json
# 4. 将原始视频与新音频进行口型同步
infsh app run infsh/latentsync-1-6 --input '{
"video_url": "https://original-video.mp4",
"audio_url": "<new-audio-url>"
}'
# 完整平台技能(所有 150+ 应用)
npx skills add inference-sh/skills@infsh-cli
# 文本转语音(为虚拟形象生成音频)
npx skills add inference-sh/skills@text-to-speech
# 语音转文本(为配音转录)
npx skills add inference-sh/skills@speech-to-text
# 视频生成
npx skills add inference-sh/skills@ai-video-generation
# 图像生成(创建虚拟形象图像)
npx skills add inference-sh/skills@ai-image-generation
浏览所有视频应用:infsh app list --category video
每周安装量
7.3K
代码仓库
GitHub 星标数
228
首次出现
2026年3月12日
安全审计
安装于
claude-code5.8K
gemini-cli5.3K
codex5.2K
opencode5.2K
amp5.2K
github-copilot5.2K
Create AI avatars and talking head videos via inference.sh CLI.

Requires inference.sh CLI (
infsh). Install instructions
infsh login
# Create avatar video from image + audio
infsh app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://portrait.jpg",
"audio_url": "https://speech.mp3"
}'
| Model | App ID | Best For |
|---|---|---|
| OmniHuman 1.5 | bytedance/omnihuman-1-5 | Multi-character, best quality |
| OmniHuman 1.0 | bytedance/omnihuman-1-0 | Single character |
| Fabric 1.0 | falai/fabric-1-0 | Image talks with lipsync |
| PixVerse Lipsync | falai/pixverse-lipsync | Highly realistic |
infsh app list --search "omnihuman"
infsh app list --search "lipsync"
infsh app list --search "fabric"
infsh app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://portrait.jpg",
"audio_url": "https://speech.mp3"
}'
Supports specifying which character to drive in multi-person images.
infsh app run falai/fabric-1-0 --input '{
"image_url": "https://face.jpg",
"audio_url": "https://audio.mp3"
}'
infsh app run falai/pixverse-lipsync --input '{
"image_url": "https://portrait.jpg",
"audio_url": "https://speech.mp3"
}'
Generates highly realistic lipsync from any audio.
# 1. Generate speech from text
infsh app run infsh/kokoro-tts --input '{
"prompt": "Welcome to our product demo. Today I will show you..."
}' > speech.json
# 2. Create avatar video with the speech
infsh app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://presenter-photo.jpg",
"audio_url": "<audio-url-from-step-1>"
}'
# 1. Transcribe original video
infsh app run infsh/fast-whisper-large-v3 --input '{"audio_url": "https://video.mp4"}' > transcript.json
# 2. Translate text (manually or with an LLM)
# 3. Generate speech in new language
infsh app run infsh/kokoro-tts --input '{"text": "<translated-text>"}' > new_speech.json
# 4. Lipsync the original video with new audio
infsh app run infsh/latentsync-1-6 --input '{
"video_url": "https://original-video.mp4",
"audio_url": "<new-audio-url>"
}'
# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@infsh-cli
# Text-to-speech (generate audio for avatars)
npx skills add inference-sh/skills@text-to-speech
# Speech-to-text (transcribe for dubbing)
npx skills add inference-sh/skills@speech-to-text
# Video generation
npx skills add inference-sh/skills@ai-video-generation
# Image generation (create avatar images)
npx skills add inference-sh/skills@ai-image-generation
Browse all video apps: infsh app list --category video
Weekly Installs
7.3K
Repository
GitHub Stars
228
First Seen
Mar 12, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykWarn
Installed on
claude-code5.8K
gemini-cli5.3K
codex5.2K
opencode5.2K
amp5.2K
github-copilot5.2K
专业文案撰写指南:转化文案写作技巧、框架与SEO优化原则
48,600 周安装