alicloud-ai-audio-tts by cinience/alicloud-skills
npx skills add https://github.com/cinience/alicloud-skills --skill alicloud-ai-audio-ttsCategory: provider
mkdir -p output/alicloud-ai-audio-tts
python -m py_compile skills/ai/audio/alicloud-ai-audio-tts/scripts/generate_tts.py && echo "py_compile_ok" > output/alicloud-ai-audio-tts/validate.txt
通过标准:命令退出码为 0 且生成 output/alicloud-ai-audio-tts/validate.txt 文件。
output/alicloud-ai-audio-tts/ 目录。使用以下推荐模型之一:
qwen3-tts-flashqwen3-tts-instruct-flashqwen3-tts-instruct-flash-2026-01-26广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
安装 SDK(建议在虚拟环境中安装以避免 PEP 668 限制):
python3 -m venv .venv . .venv/bin/activate python -m pip install dashscope
在环境变量中设置 DASHSCOPE_API_KEY,或在 ~/.alibabacloud/credentials 文件中添加 dashscope_api_key(环境变量优先级更高)。
text (字符串,必需)voice (字符串,必需)language_type (字符串,可选;默认值 Auto)instruction (字符串,可选;推荐用于 instruct 模型)stream (布尔值,可选;默认值 false)audio_url (字符串,当 stream=false 时)audio_base64_pcm (字符串,当 stream=true 时)sample_rate (整数,24000)format (字符串,根据模式为 wav 或 pcm)import os
import dashscope
# 推荐使用环境变量进行认证:export DASHSCOPE_API_KEY=...
# 或者使用 ~/.alibabacloud/credentials 文件,在 [default] 下设置 dashscope_api_key。
# 北京区域;如需使用新加坡区域,请设置:https://dashscope-intl.aliyuncs.com/api/v1
dashscope.base_http_api_url = "https://dashscope.aliyuncs.com/api/v1"
text = "Hello, this is a short voice line."
response = dashscope.MultiModalConversation.call(
model="qwen3-tts-instruct-flash",
api_key=os.getenv("DASHSCOPE_API_KEY"),
text=text,
voice="Cherry",
language_type="English",
instruction="Warm and calm tone, slightly slower pace.",
stream=False,
)
audio_url = response.output.audio.url
print(audio_url)
stream=True 返回 Base64 编码的 24kHz PCM 数据块。finish_reason == "stop"。language_type 以提高发音准确性。instruction。(text, voice, language_type) 进行缓存以避免重复费用。output/alicloud-ai-audio-tts/audio/OUTPUT_DIR 环境变量覆盖基础目录。references/api_reference.md 包含参数映射和流式传输示例。
实时模式由 skills/ai/audio/alicloud-ai-audio-tts-realtime/ 提供。
语音克隆/设计由 skills/ai/audio/alicloud-ai-audio-tts-voice-clone/ 和 skills/ai/audio/alicloud-ai-audio-tts-voice-design/ 提供。
来源列表:references/sources.md
每周安装量
183
代码仓库
GitHub 星标数
340
首次出现
2026年2月7日
安全审计
安装于
gemini-cli182
github-copilot182
codex182
kimi-cli182
amp182
opencode182
Category: provider
mkdir -p output/alicloud-ai-audio-tts
python -m py_compile skills/ai/audio/alicloud-ai-audio-tts/scripts/generate_tts.py && echo "py_compile_ok" > output/alicloud-ai-audio-tts/validate.txt
Pass criteria: command exits 0 and output/alicloud-ai-audio-tts/validate.txt is generated.
output/alicloud-ai-audio-tts/.Use one of the recommended models:
qwen3-tts-flashqwen3-tts-instruct-flashqwen3-tts-instruct-flash-2026-01-26Install SDK (recommended in a venv to avoid PEP 668 limits):
python3 -m venv .venv . .venv/bin/activate python -m pip install dashscope
Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials (env takes precedence).
text (string, required)voice (string, required)language_type (string, optional; default Auto)instruction (string, optional; recommended for instruct models)stream (bool, optional; default false)audio_url (string, when stream=false)audio_base64_pcm (string, when stream=true)sample_rate (int, 24000)format (string, wav or pcm depending on mode)import os
import dashscope
# Prefer env var for auth: export DASHSCOPE_API_KEY=...
# Or use ~/.alibabacloud/credentials with dashscope_api_key under [default].
# Beijing region; for Singapore use: https://dashscope-intl.aliyuncs.com/api/v1
dashscope.base_http_api_url = "https://dashscope.aliyuncs.com/api/v1"
text = "Hello, this is a short voice line."
response = dashscope.MultiModalConversation.call(
model="qwen3-tts-instruct-flash",
api_key=os.getenv("DASHSCOPE_API_KEY"),
text=text,
voice="Cherry",
language_type="English",
instruction="Warm and calm tone, slightly slower pace.",
stream=False,
)
audio_url = response.output.audio.url
print(audio_url)
stream=True returns Base64-encoded PCM chunks at 24kHz.finish_reason == "stop" when the stream ends.language_type consistent with the text to improve pronunciation.instruction only when you need explicit style/tone control.(text, voice, language_type) to avoid repeat costs.output/alicloud-ai-audio-tts/audio/OUTPUT_DIR.references/api_reference.md for parameter mapping and streaming example.
Realtime mode is provided by skills/ai/audio/alicloud-ai-audio-tts-realtime/.
Voice cloning/design are provided by skills/ai/audio/alicloud-ai-audio-tts-voice-clone/ and skills/ai/audio/alicloud-ai-audio-tts-voice-design/.
Source list: references/sources.md
Weekly Installs
183
Repository
GitHub Stars
340
First Seen
Feb 7, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykPass
Installed on
gemini-cli182
github-copilot182
codex182
kimi-cli182
amp182
opencode182
Azure 配额管理指南:服务限制、容量验证与配额增加方法
79,700 周安装
竞争对手研究指南:SEO、内容、反向链接与定价分析工具
231 周安装
Azure 工作负载自动升级评估工具 - 支持 Functions、App Service 计划与 SKU 迁移
231 周安装
Kaizen持续改进方法论:软件开发中的渐进式优化与防错设计实践指南
231 周安装
软件UI/UX设计指南:以用户为中心的设计原则、WCAG可访问性与平台规范
231 周安装
Apify 网络爬虫和自动化平台 - 无需编码抓取亚马逊、谷歌、领英等网站数据
231 周安装
llama.cpp 中文指南:纯 C/C++ LLM 推理,CPU/非 NVIDIA 硬件优化部署
231 周安装