ai-voice-cloning by inferen-sh/skills
npx skills add https://github.com/inferen-sh/skills --skill ai-voice-cloning通过 inference.sh CLI 生成自然的 AI 语音。

需要 inference.sh CLI (
infsh)。安装说明
infsh login
# 生成语音
infsh app run infsh/kokoro-tts --input '{
"prompt": "Hello! This is an AI-generated voice that sounds natural and engaging.",
"voice": "af_sarah"
}'
| 模型 | 应用 ID | 最佳用途 |
|---|---|---|
| ElevenLabs TTS | elevenlabs/tts | 高品质,22+ 种语音,32 种语言 |
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
| ElevenLabs 语音转换器 | elevenlabs/voice-changer | 转换现有录音语音 |
| Kokoro TTS | infsh/kokoro-tts | 自然,多种语音 |
| DIA | infsh/dia-tts | 对话式,富有表现力 |
| Chatterbox | infsh/chatterbox | 休闲,娱乐 |
| Higgs | infsh/higgs-tts | 专业旁白 |
| VibeVoice | infsh/vibevoice | 情感丰富 |
| 语音 ID | 性别 | 风格 |
|---|---|---|
af_sarah | 女声 | 温暖,友好 |
af_nicole | 女声 | 专业 |
af_sky | 女声 | 年轻 |
am_michael | 男声 | 权威 |
am_adam | 男声 | 对话式 |
am_echo | 男声 | 清晰,中性 |
| 语音 ID | 性别 | 风格 |
|---|---|---|
bf_emma | 女声 | 优雅 |
bf_isabella | 女声 | 温暖 |
bm_george | 男声 | 经典 |
bm_lewis | 男声 | 现代 |
infsh app run infsh/kokoro-tts --input '{
"prompt": "Welcome to our quarterly earnings call. Today we will discuss the financial performance and strategic initiatives for the past quarter.",
"voice": "am_michael",
"speed": 1.0
}'
infsh app run infsh/dia-tts --input '{
"text": "Hey, so I was thinking about that project we discussed. What if we tried a different approach?",
"voice": "conversational"
}'
infsh app run infsh/kokoro-tts --input '{
"prompt": "Chapter One. The morning mist hung low over the valley as Sarah made her way down the winding path. She had been walking for hours.",
"voice": "bf_emma",
"speed": 0.9
}'
infsh app run infsh/kokoro-tts --input '{
"prompt": "Introducing the next generation of productivity. Work smarter, not harder.",
"voice": "af_nicole",
"speed": 1.1
}'
infsh app run infsh/kokoro-tts --input '{
"prompt": "Welcome back to Tech Talk! Im your host, and today we are diving deep into the world of artificial intelligence.",
"voice": "am_adam"
}'
# 生成两个说话者之间的对话
# 说话者 1
infsh app run infsh/kokoro-tts --input '{
"prompt": "Have you seen the latest AI developments? Its incredible how fast things are moving.",
"voice": "am_michael"
}' > speaker1.json
# 说话者 2
infsh app run infsh/kokoro-tts --input '{
"prompt": "I know, right? Just last week I tried that new image generator and was blown away.",
"voice": "af_sarah"
}' > speaker2.json
# 合并对话
infsh app run infsh/media-merger --input '{
"audio_files": ["<speaker1-url>", "<speaker2-url>"],
"crossfade_ms": 300
}'
对于超过 5000 个字符的内容,请分割成块:
# 分块处理长文本
TEXT="Your very long text here..."
# 分割并生成
# 块 1
infsh app run infsh/kokoro-tts --input '{
"prompt": "<chunk-1>",
"voice": "bf_emma"
}' > chunk1.json
# 块 2
infsh app run infsh/kokoro-tts --input '{
"prompt": "<chunk-2>",
"voice": "bf_emma"
}' > chunk2.json
# 合并块
infsh app run infsh/media-merger --input '{
"audio_files": ["<chunk1-url>", "<chunk2-url>"],
"crossfade_ms": 100
}'
# 1. 生成配音
infsh app run infsh/kokoro-tts --input '{
"prompt": "This stunning footage shows the beauty of nature in its purest form.",
"voice": "am_michael"
}' > voiceover.json
# 2. 与视频合并
infsh app run infsh/media-merger --input '{
"video_url": "https://your-video.mp4",
"audio_url": "<voiceover-url>"
}'
# 1. 生成语音
infsh app run infsh/kokoro-tts --input '{
"prompt": "Hi, Im excited to share some updates with you today.",
"voice": "af_sarah"
}' > speech.json
# 2. 使用头像制作动画
infsh app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://portrait.jpg",
"audio_url": "<speech-url>"
}'
| 速度 | 效果 | 适用于 |
|---|---|---|
| 0.8 | 缓慢,从容 | 有声书,冥想 |
| 0.9 | 稍慢 | 教育,教程 |
| 1.0 | 正常 | 通用 |
| 1.1 | 稍快 | 广告,充满活力 |
| 1.2 | 快速 | 快速公告 |
# 慢速旁白
infsh app run infsh/kokoro-tts --input '{
"prompt": "Take a deep breath. Let yourself relax.",
"voice": "bf_emma",
"speed": 0.8
}'
使用标点符号来控制语音节奏:
| 标点符号 | 效果 |
|---|---|
句号 . | 完全停顿 |
逗号 , | 短暂停顿 |
... | 延长停顿 |
! | 强调 |
? | 疑问语调 |
- | 快速中断 |
infsh app run infsh/kokoro-tts --input '{
"prompt": "Wait... Did you hear that? Something is coming. Something big!",
"voice": "am_adam"
}'
# ElevenLabs TTS (高级,22+ 种语音)
npx skills add inference-sh/skills@elevenlabs-tts
# ElevenLabs 语音转换器 (转换录音)
npx skills add inference-sh/skills@elevenlabs-voice-changer
# 所有 TTS 模型
npx skills add inference-sh/skills@text-to-speech
# 播客创作
npx skills add inference-sh/skills@ai-podcast-creation
# AI 头像
npx skills add inference-sh/skills@ai-avatar-video
# 视频生成
npx skills add inference-sh/skills@ai-video-generation
# 完整平台技能
npx skills add inference-sh/skills@infsh-cli
浏览音频应用:infsh app list --category audio
每周安装量
7.3K
代码仓库
GitHub 星标数
202
首次出现
14 天前
安全审计
安装于
claude-code5.8K
gemini-cli5.2K
codex5.2K
opencode5.2K
amp5.2K
kimi-cli5.2K
Generate natural AI voices via inference.sh CLI.

Requires inference.sh CLI (
infsh). Install instructions
infsh login
# Generate speech
infsh app run infsh/kokoro-tts --input '{
"prompt": "Hello! This is an AI-generated voice that sounds natural and engaging.",
"voice": "af_sarah"
}'
| Model | App ID | Best For |
|---|---|---|
| ElevenLabs TTS | elevenlabs/tts | Premium quality, 22+ voices, 32 languages |
| ElevenLabs Voice Changer | elevenlabs/voice-changer | Transform existing voice recordings |
| Kokoro TTS | infsh/kokoro-tts | Natural, multiple voices |
| DIA | infsh/dia-tts | Conversational, expressive |
| Chatterbox | infsh/chatterbox | Casual, entertainment |
| Higgs | infsh/higgs-tts | Professional narration |
| VibeVoice | infsh/vibevoice | Emotional range |
| Voice ID | Gender | Style |
|---|---|---|
af_sarah | Female | Warm, friendly |
af_nicole | Female | Professional |
af_sky | Female | Youthful |
am_michael | Male | Authoritative |
am_adam | Male | Conversational |
| Voice ID | Gender | Style |
|---|---|---|
bf_emma | Female | Refined |
bf_isabella | Female | Warm |
bm_george | Male | Classic |
bm_lewis | Male | Modern |
infsh app run infsh/kokoro-tts --input '{
"prompt": "Welcome to our quarterly earnings call. Today we will discuss the financial performance and strategic initiatives for the past quarter.",
"voice": "am_michael",
"speed": 1.0
}'
infsh app run infsh/dia-tts --input '{
"text": "Hey, so I was thinking about that project we discussed. What if we tried a different approach?",
"voice": "conversational"
}'
infsh app run infsh/kokoro-tts --input '{
"prompt": "Chapter One. The morning mist hung low over the valley as Sarah made her way down the winding path. She had been walking for hours.",
"voice": "bf_emma",
"speed": 0.9
}'
infsh app run infsh/kokoro-tts --input '{
"prompt": "Introducing the next generation of productivity. Work smarter, not harder.",
"voice": "af_nicole",
"speed": 1.1
}'
infsh app run infsh/kokoro-tts --input '{
"prompt": "Welcome back to Tech Talk! Im your host, and today we are diving deep into the world of artificial intelligence.",
"voice": "am_adam"
}'
# Generate dialogue between two speakers
# Speaker 1
infsh app run infsh/kokoro-tts --input '{
"prompt": "Have you seen the latest AI developments? Its incredible how fast things are moving.",
"voice": "am_michael"
}' > speaker1.json
# Speaker 2
infsh app run infsh/kokoro-tts --input '{
"prompt": "I know, right? Just last week I tried that new image generator and was blown away.",
"voice": "af_sarah"
}' > speaker2.json
# Merge conversation
infsh app run infsh/media-merger --input '{
"audio_files": ["<speaker1-url>", "<speaker2-url>"],
"crossfade_ms": 300
}'
For content over 5000 characters, split into chunks:
# Process long text in chunks
TEXT="Your very long text here..."
# Split and generate
# Chunk 1
infsh app run infsh/kokoro-tts --input '{
"prompt": "<chunk-1>",
"voice": "bf_emma"
}' > chunk1.json
# Chunk 2
infsh app run infsh/kokoro-tts --input '{
"prompt": "<chunk-2>",
"voice": "bf_emma"
}' > chunk2.json
# Merge chunks
infsh app run infsh/media-merger --input '{
"audio_files": ["<chunk1-url>", "<chunk2-url>"],
"crossfade_ms": 100
}'
# 1. Generate voiceover
infsh app run infsh/kokoro-tts --input '{
"prompt": "This stunning footage shows the beauty of nature in its purest form.",
"voice": "am_michael"
}' > voiceover.json
# 2. Merge with video
infsh app run infsh/media-merger --input '{
"video_url": "https://your-video.mp4",
"audio_url": "<voiceover-url>"
}'
# 1. Generate speech
infsh app run infsh/kokoro-tts --input '{
"prompt": "Hi, Im excited to share some updates with you today.",
"voice": "af_sarah"
}' > speech.json
# 2. Animate with avatar
infsh app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://portrait.jpg",
"audio_url": "<speech-url>"
}'
| Speed | Effect | Use For |
|---|---|---|
| 0.8 | Slow, deliberate | Audiobooks, meditation |
| 0.9 | Slightly slow | Education, tutorials |
| 1.0 | Normal | General purpose |
| 1.1 | Slightly fast | Commercials, energy |
| 1.2 | Fast | Quick announcements |
# Slow narration
infsh app run infsh/kokoro-tts --input '{
"prompt": "Take a deep breath. Let yourself relax.",
"voice": "bf_emma",
"speed": 0.8
}'
Use punctuation to control speech rhythm:
| Punctuation | Effect |
|---|---|
Period . | Full pause |
Comma , | Brief pause |
... | Extended pause |
! | Emphasis |
? | Question intonation |
- | Quick break |
infsh app run infsh/kokoro-tts --input '{
"prompt": "Wait... Did you hear that? Something is coming. Something big!",
"voice": "am_adam"
}'
# ElevenLabs TTS (premium, 22+ voices)
npx skills add inference-sh/skills@elevenlabs-tts
# ElevenLabs voice changer (transform recordings)
npx skills add inference-sh/skills@elevenlabs-voice-changer
# All TTS models
npx skills add inference-sh/skills@text-to-speech
# Podcast creation
npx skills add inference-sh/skills@ai-podcast-creation
# AI avatars
npx skills add inference-sh/skills@ai-avatar-video
# Video generation
npx skills add inference-sh/skills@ai-video-generation
# Full platform skill
npx skills add inference-sh/skills@infsh-cli
Browse audio apps: infsh app list --category audio
Weekly Installs
7.3K
Repository
GitHub Stars
202
First Seen
14 days ago
Security Audits
Gen Agent Trust HubPassSocketWarnSnykPass
Installed on
claude-code5.8K
gemini-cli5.2K
codex5.2K
opencode5.2K
amp5.2K
kimi-cli5.2K
React 组合模式指南:Vercel 组件架构最佳实践,提升代码可维护性
102,200 周安装
am_echo |
| Male |
| Clear, neutral |