阿里云CosyVoice语音克隆教程：AI语音定制与TTS合成API使用指南

alicloud-ai-audio-cosyvoice-voice-clone by cinience/alicloud-skills

110 周安装量

368 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/cinience/alicloud-skills --skill alicloud-ai-audio-cosyvoice-voice-clone

AI/机器学习云服务音频处理

🇨🇳中文介绍

Category: provider

Model Studio CosyVoice 语音克隆

使用 CosyVoice 语音注册 API，通过公开参考音频创建克隆语音。

关键模型名称

使用 model="voice-enrollment" 和以下 target_model 值之一：

cosyvoice-v3.5-plus
cosyvoice-v3.5-flash
cosyvoice-v3-plus
cosyvoice-v3-flash
cosyvoice-v2

本仓库推荐的默认值：

target_model="cosyvoice-v3.5-plus"

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

🇺🇸English

Category: provider

Model Studio CosyVoice Voice Clone

Use the CosyVoice voice enrollment API to create cloned voices from public reference audio.

Critical model names

Use model="voice-enrollment" and one of these target_model values:

cosyvoice-v3.5-plus
cosyvoice-v3.5-flash
cosyvoice-v3-plus
cosyvoice-v3-flash
cosyvoice-v2

Recommended default in this repo:

target_model="cosyvoice-v3.5-plus"

Region and compatibility

cosyvoice-v3.5-plus and cosyvoice-v3.5-flash are available only in China mainland deployment mode (Beijing endpoint).
In international deployment mode (Singapore endpoint), cosyvoice-v3-plus and cosyvoice-v3-flash do not support voice clone/design.
The target_model used during enrollment must match the model used later in speech synthesis, otherwise synthesis fails.

Endpoint

Domestic: https://dashscope.aliyuncs.com/api/v1/services/audio/tts/customization
International: https://dashscope-intl.aliyuncs.com/api/v1/services/audio/tts/customization

Prerequisites

Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.
Provide a public audio URL for the enrollment sample.

Normalized interface (cosyvoice.voice_clone)

Request

model (string, optional): fixed to voice-enrollment
target_model (string, optional): default cosyvoice-v3.5-plus
prefix (string, required): letters/digits only, max 10 chars
voice_sample_url (string, required): public audio URL
language_hints (array[string], optional): only first item is used
max_prompt_audio_length (float, optional): only for cosyvoice-v3.5-plus, cosyvoice-v3.5-flash,

Response

voice_id (string): use this as the voice parameter in later TTS calls
request_id (string)
usage.count (number, optional)

Operational guidance

For Chinese dialect reference audio, keep language_hints=["zh"]; control dialect style later in synthesis via text or instruct.
For cosyvoice-v3.5-plus, supported language_hints include zh, en, fr, de, ja, ko, ru, , , , .

Local helper script

Prepare a normalized request JSON:

python skills/ai/audio/alicloud-ai-audio-cosyvoice-voice-clone/scripts/prepare_cosyvoice_clone_request.py \
  --target-model cosyvoice-v3.5-plus \
  --prefix myvoice \
  --voice-sample-url https://example.com/voice.wav \
  --language-hint zh

Validation

mkdir -p output/alicloud-ai-audio-cosyvoice-voice-clone
for f in skills/ai/audio/alicloud-ai-audio-cosyvoice-voice-clone/scripts/*.py; do
  python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/alicloud-ai-audio-cosyvoice-voice-clone/validate.txt

Pass criteria: command exits 0 and output/alicloud-ai-audio-cosyvoice-voice-clone/validate.txt is generated.

Output And Evidence

Save artifacts, command outputs, and API response summaries under output/alicloud-ai-audio-cosyvoice-voice-clone/.
Include target_model, prefix, and sample URL in the evidence file.

References

references/api_reference.md
references/sources.md

Weekly Installs

Repository

cinience/alicloud-skills

GitHub Stars

364

First Seen

13 days ago

Security Audits

Gen Agent Trust HubPass SocketPass SnykWarn

Installed on

github-copilot83

codex83

amp83

cline83

kimi-cli83

gemini-cli83

阿里云CosyVoice语音克隆教程：AI语音定制与TTS合成API使用指南

🇨🇳中文介绍

Model Studio CosyVoice 语音克隆

关键模型名称

相关 Skills

区域与兼容性

端点

前提条件

标准化接口 (cosyvoice.voice_clone)

请求

响应

操作指南

本地辅助脚本

验证

输出与证据

参考