text-to-speech by inferen-sh/skills
npx skills add https://github.com/inferen-sh/skills --skill text-to-speech通过 inference.sh CLI 将文本转换为自然语音。

需要 inference.sh CLI (
infsh)。安装说明
infsh login
# 生成语音
infsh app run infsh/kokoro-tts --input '{"text": "Hello, welcome to our product demo."}'
| 模型 | 应用 ID | 最佳用途 |
|---|---|---|
| ElevenLabs TTS | elevenlabs/tts | 高品质,22+ 种声音,32 种语言 |
| DIA TTS |
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
infsh/dia-tts |
| 对话式,富有表现力 |
| Kokoro TTS | infsh/kokoro-tts | 快速,自然 |
| Chatterbox | infsh/chatterbox | 通用 |
| Higgs Audio | infsh/higgs-audio | 情感控制 |
| VibeVoice | infsh/vibevoice | 播客,长篇内容 |
infsh app list --category audio
infsh app run infsh/kokoro-tts --input '{"text": "Welcome to our tutorial."}'
infsh app sample infsh/dia-tts --save input.json
# 编辑 input.json:
# {
# "text": "嘿!你今天怎么样?我真的很兴奋能和你分享这个。",
# "voice": "conversational"
# }
infsh app run infsh/dia-tts --input input.json
infsh app sample infsh/vibevoice --save input.json
# 使用你的播客脚本编辑 input.json
infsh app run infsh/vibevoice --input input.json
infsh app sample infsh/higgs-audio --save input.json
# {
# "text": "这简直太不可思议了!",
# "emotion": "excited"
# }
infsh app run infsh/higgs-audio --input input.json
生成语音,然后创建会说话的头部视频:
# 1. 生成语音
infsh app run infsh/kokoro-tts --input '{"text": "你的脚本在这里"}' > speech.json
# 2. 将音频 URL 与 OmniHuman 结合使用,生成虚拟形象视频
infsh app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://portrait.jpg",
"audio_url": "<audio-url-from-step-1>"
}'
# ElevenLabs TTS (高级,22+ 种声音)
npx skills add inference-sh/skills@elevenlabs-tts
# ElevenLabs 对话 (多说话人)
npx skills add inference-sh/skills@elevenlabs-dialogue
# 完整平台技能 (所有 150+ 个应用)
npx skills add inference-sh/skills@infsh-cli
# AI 虚拟形象 (将 TTS 与会说话的头部结合)
npx skills add inference-sh/skills@ai-avatar-video
# AI 音乐生成
npx skills add inference-sh/skills@ai-music-generation
# 语音转文本 (转录)
npx skills add inference-sh/skills@speech-to-text
# 视频生成
npx skills add inference-sh/skills@ai-video-generation
浏览所有应用:infsh app list
每周安装量
7.2K
仓库
GitHub 星标数
184
首次出现
13 天前
安全审计
安装于
claude-code5.7K
gemini-cli5.1K
codex5.1K
amp5.1K
opencode5.1K
kimi-cli5.1K
Convert text to natural speech via inference.sh CLI.

Requires inference.sh CLI (
infsh). Install instructions
infsh login
# Generate speech
infsh app run infsh/kokoro-tts --input '{"text": "Hello, welcome to our product demo."}'
| Model | App ID | Best For |
|---|---|---|
| ElevenLabs TTS | elevenlabs/tts | Premium quality, 22+ voices, 32 languages |
| DIA TTS | infsh/dia-tts | Conversational, expressive |
| Kokoro TTS | infsh/kokoro-tts | Fast, natural |
| Chatterbox | infsh/chatterbox | General purpose |
| Higgs Audio | infsh/higgs-audio | Emotional control |
| VibeVoice | infsh/vibevoice | Podcasts, long-form |
infsh app list --category audio
infsh app run infsh/kokoro-tts --input '{"text": "Welcome to our tutorial."}'
infsh app sample infsh/dia-tts --save input.json
# Edit input.json:
# {
# "text": "Hey! How are you doing today? I'm really excited to share this with you.",
# "voice": "conversational"
# }
infsh app run infsh/dia-tts --input input.json
infsh app sample infsh/vibevoice --save input.json
# Edit input.json with your podcast script
infsh app run infsh/vibevoice --input input.json
infsh app sample infsh/higgs-audio --save input.json
# {
# "text": "This is absolutely incredible!",
# "emotion": "excited"
# }
infsh app run infsh/higgs-audio --input input.json
Generate speech, then create a talking head video:
# 1. Generate speech
infsh app run infsh/kokoro-tts --input '{"text": "Your script here"}' > speech.json
# 2. Use the audio URL with OmniHuman for avatar video
infsh app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://portrait.jpg",
"audio_url": "<audio-url-from-step-1>"
}'
# ElevenLabs TTS (premium, 22+ voices)
npx skills add inference-sh/skills@elevenlabs-tts
# ElevenLabs dialogue (multi-speaker)
npx skills add inference-sh/skills@elevenlabs-dialogue
# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@infsh-cli
# AI avatars (combine TTS with talking heads)
npx skills add inference-sh/skills@ai-avatar-video
# AI music generation
npx skills add inference-sh/skills@ai-music-generation
# Speech-to-text (transcription)
npx skills add inference-sh/skills@speech-to-text
# Video generation
npx skills add inference-sh/skills@ai-video-generation
Browse all apps: infsh app list
Weekly Installs
7.2K
Repository
GitHub Stars
184
First Seen
13 days ago
Security Audits
Gen Agent Trust HubPassSocketWarnSnykWarn
Installed on
claude-code5.7K
gemini-cli5.1K
codex5.1K
amp5.1K
opencode5.1K
kimi-cli5.1K
99,500 周安装