重要前提
安装AI Skills的关键前提是:必须科学上网,且开启TUN模式,这一点至关重要,直接决定安装能否顺利完成,在此郑重提醒三遍:科学上网,科学上网,科学上网。查看完整安装教程 →
google-tts by sanjay3290/ai-skills
npx skills add https://github.com/sanjay3290/ai-skills --skill google-tts使用 Google Cloud TTS API 将文本和文档转换为音频。支持超过 40 种语言的 Neural2、WaveNet、Studio 和 Standard 语音。
通过 GOOGLE_TTS_API_KEY 环境变量或 skills/google-tts/config.json 文件(内容为 {"api_key": "..."})提供 API 密钥。处理多分块文档需要 ffmpeg。可选:为处理 PDF/DOCX 文件,可执行 pip install PyPDF2 python-docx。
python skills/google-tts/scripts/google_tts.py voices --language en-US --type Neural2
python skills/google-tts/scripts/google_tts.py voices --json
# 从文本或文档(PDF, DOCX, MD, TXT)转换
python skills/google-tts/scripts/google_tts.py tts --text "Hello world" --output ~/Downloads/hello.mp3
python skills/google-tts/scripts/google_tts.py tts --file /path/to/doc.pdf --output ~/Downloads/narration.mp3
# 使用语音、语速、音高、编码选项
python skills/google-tts/scripts/google_tts.py tts --file doc.md --voice en-US-Neural2-F --rate 0.9 --encoding MP3 --output ~/Downloads/out.mp3
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
接收一个包含交替发言者的 JSON 脚本,使用不同的语音合成每个部分。
[
{"speaker": "host1", "text": "Welcome to our podcast!"},
{"speaker": "host2", "text": "Thanks for having me..."}
]
python skills/google-tts/scripts/google_tts.py podcast --script /tmp/script.json --output ~/Downloads/podcast.mp3
python skills/google-tts/scripts/google_tts.py podcast --script /tmp/script.json --voice1 en-US-Neural2-J --voice2 en-US-Neural2-H --rate 0.9 --output ~/Downloads/podcast.mp3
--file 参数。对于生成的内容,首先将清晰的散文写入 /tmp/tts_input.md。en-US-Neural2-D(男声)或 en-US-Neural2-F(女声)。为获得最佳质量/成本平衡,请使用 Neural2。python skills/google-tts/scripts/google_tts.py tts --file /tmp/tts_input.md --output ~/Downloads/recording.mp3~/Downloads/。python skills/google-tts/scripts/extract.py /path/to/document.pdf/tmp/podcast_script.jsonpython skills/google-tts/scripts/google_tts.py podcast --script /tmp/podcast_script.json --output ~/Downloads/podcast.mp3每周安装次数
55
代码仓库
GitHub 星标数
175
首次出现
2026年2月24日
安全审计
安装于
gemini-cli52
amp52
github-copilot52
codex52
kimi-cli52
opencode52
Converts text and documents into audio using Google Cloud TTS API. Supports Neural2, WaveNet, Studio, and Standard voices across 40+ languages.
API key via GOOGLE_TTS_API_KEY env var or skills/google-tts/config.json with {"api_key": "..."}. Requires ffmpeg for multi-chunk documents. Optional: pip install PyPDF2 python-docx for PDF/DOCX.
python skills/google-tts/scripts/google_tts.py voices --language en-US --type Neural2
python skills/google-tts/scripts/google_tts.py voices --json
# From text or document (PDF, DOCX, MD, TXT)
python skills/google-tts/scripts/google_tts.py tts --text "Hello world" --output ~/Downloads/hello.mp3
python skills/google-tts/scripts/google_tts.py tts --file /path/to/doc.pdf --output ~/Downloads/narration.mp3
# With voice, rate, pitch, encoding options
python skills/google-tts/scripts/google_tts.py tts --file doc.md --voice en-US-Neural2-F --rate 0.9 --encoding MP3 --output ~/Downloads/out.mp3
Takes a JSON script with alternating speakers, synthesizes each with a different voice.
[
{"speaker": "host1", "text": "Welcome to our podcast!"},
{"speaker": "host2", "text": "Thanks for having me..."}
]
python skills/google-tts/scripts/google_tts.py podcast --script /tmp/script.json --output ~/Downloads/podcast.mp3
python skills/google-tts/scripts/google_tts.py podcast --script /tmp/script.json --voice1 en-US-Neural2-J --voice2 en-US-Neural2-H --rate 0.9 --output ~/Downloads/podcast.mp3
--file. For generated content, write clean prose to /tmp/tts_input.md first.en-US-Neural2-D (male) or en-US-Neural2-F (female). Use Neural2 for best quality/cost balance.python skills/google-tts/scripts/google_tts.py tts --file /tmp/tts_input.md --output ~/Downloads/recording.mp3~/Downloads/.python skills/google-tts/scripts/extract.py /path/to/document.pdf/tmp/podcast_script.jsonpython skills/google-tts/scripts/google_tts.py podcast --script /tmp/podcast_script.json --output ~/Downloads/podcast.mp3Weekly Installs
55
Repository
GitHub Stars
175
First Seen
Feb 24, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykPass
Installed on
gemini-cli52
amp52
github-copilot52
codex52
kimi-cli52
opencode52
AI界面设计评审工具 - 全面评估UI/UX设计质量、检测AI生成痕迹与优化用户体验
58,500 周安装