alicloud-ai-audio-cosyvoice-voice-clone by cinience/alicloud-skills
npx skills add https://github.com/cinience/alicloud-skills --skill alicloud-ai-audio-cosyvoice-voice-cloneCategory: provider
使用 CosyVoice 语音注册 API,通过公开参考音频创建克隆语音。
使用 model="voice-enrollment" 和以下 target_model 值之一:
cosyvoice-v3.5-pluscosyvoice-v3.5-flashcosyvoice-v3-pluscosyvoice-v3-flashcosyvoice-v2本仓库推荐的默认值:
target_model="cosyvoice-v3.5-plus"广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
cosyvoice-v3.5-plus 和 cosyvoice-v3.5-flash 仅在中国大陆部署模式(北京端点)下可用。cosyvoice-v3-plus 和 cosyvoice-v3-flash 不支持语音克隆/设计。target_model 必须与后续语音合成时使用的模型匹配,否则合成将失败。https://dashscope.aliyuncs.com/api/v1/services/audio/tts/customizationhttps://dashscope-intl.aliyuncs.com/api/v1/services/audio/tts/customizationDASHSCOPE_API_KEY,或在 ~/.alibabacloud/credentials 中添加 dashscope_api_key。model (字符串, 可选): 固定为 voice-enrollmenttarget_model (字符串, 可选): 默认 cosyvoice-v3.5-plusprefix (字符串, 必需): 仅限字母/数字,最多 10 个字符voice_sample_url (字符串, 必需): 公开音频 URLlanguage_hints (数组[字符串], 可选): 仅使用第一项max_prompt_audio_length (浮点数, 可选): 仅适用于 cosyvoice-v3.5-plus, cosyvoice-v3.5-flash, cosyvoice-v3-flashenable_preprocess (布尔值, 可选): 仅适用于 cosyvoice-v3.5-plus, cosyvoice-v3.5-flash, cosyvoice-v3-flashvoice_id (字符串): 在后续 TTS 调用中将其用作 voice 参数request_id (字符串)usage.count (数字, 可选)language_hints=["zh"];后续在合成中通过文本或 instruct 控制方言风格。cosyvoice-v3.5-plus,支持的 language_hints 包括 zh, en, fr, de, ja, ko, ru, pt, th, id, vi。准备一个标准化的请求 JSON:
python skills/ai/audio/alicloud-ai-audio-cosyvoice-voice-clone/scripts/prepare_cosyvoice_clone_request.py \
--target-model cosyvoice-v3.5-plus \
--prefix myvoice \
--voice-sample-url https://example.com/voice.wav \
--language-hint zh
mkdir -p output/alicloud-ai-audio-cosyvoice-voice-clone
for f in skills/ai/audio/alicloud-ai-audio-cosyvoice-voice-clone/scripts/*.py; do
python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/alicloud-ai-audio-cosyvoice-voice-clone/validate.txt
通过标准:命令退出码为 0 且生成 output/alicloud-ai-audio-cosyvoice-voice-clone/validate.txt 文件。
output/alicloud-ai-audio-cosyvoice-voice-clone/ 目录下。target_model、prefix 和样本 URL。references/api_reference.mdreferences/sources.md每周安装量
85
仓库
GitHub 星标数
364
首次出现
13 天前
安全审计
安装于
github-copilot83
codex83
amp83
cline83
kimi-cli83
gemini-cli83
Category: provider
Use the CosyVoice voice enrollment API to create cloned voices from public reference audio.
Use model="voice-enrollment" and one of these target_model values:
cosyvoice-v3.5-pluscosyvoice-v3.5-flashcosyvoice-v3-pluscosyvoice-v3-flashcosyvoice-v2Recommended default in this repo:
target_model="cosyvoice-v3.5-plus"cosyvoice-v3.5-plus and cosyvoice-v3.5-flash are available only in China mainland deployment mode (Beijing endpoint).cosyvoice-v3-plus and cosyvoice-v3-flash do not support voice clone/design.target_model used during enrollment must match the model used later in speech synthesis, otherwise synthesis fails.https://dashscope.aliyuncs.com/api/v1/services/audio/tts/customizationhttps://dashscope-intl.aliyuncs.com/api/v1/services/audio/tts/customizationDASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.model (string, optional): fixed to voice-enrollmenttarget_model (string, optional): default cosyvoice-v3.5-plusprefix (string, required): letters/digits only, max 10 charsvoice_sample_url (string, required): public audio URLlanguage_hints (array[string], optional): only first item is usedmax_prompt_audio_length (float, optional): only for cosyvoice-v3.5-plus, cosyvoice-v3.5-flash, voice_id (string): use this as the voice parameter in later TTS callsrequest_id (string)usage.count (number, optional)language_hints=["zh"]; control dialect style later in synthesis via text or instruct.cosyvoice-v3.5-plus, supported language_hints include zh, en, fr, de, ja, ko, ru, , , , .Prepare a normalized request JSON:
python skills/ai/audio/alicloud-ai-audio-cosyvoice-voice-clone/scripts/prepare_cosyvoice_clone_request.py \
--target-model cosyvoice-v3.5-plus \
--prefix myvoice \
--voice-sample-url https://example.com/voice.wav \
--language-hint zh
mkdir -p output/alicloud-ai-audio-cosyvoice-voice-clone
for f in skills/ai/audio/alicloud-ai-audio-cosyvoice-voice-clone/scripts/*.py; do
python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/alicloud-ai-audio-cosyvoice-voice-clone/validate.txt
Pass criteria: command exits 0 and output/alicloud-ai-audio-cosyvoice-voice-clone/validate.txt is generated.
output/alicloud-ai-audio-cosyvoice-voice-clone/.target_model, prefix, and sample URL in the evidence file.references/api_reference.mdreferences/sources.mdWeekly Installs
85
Repository
GitHub Stars
364
First Seen
13 days ago
Security Audits
Gen Agent Trust HubPassSocketPassSnykWarn
Installed on
github-copilot83
codex83
amp83
cline83
kimi-cli83
gemini-cli83
Azure 配额管理指南:服务限制、容量验证与配额增加方法
116,900 周安装
cosyvoice-v3-flashenable_preprocess (bool, optional): only for cosyvoice-v3.5-plus, cosyvoice-v3.5-flash, cosyvoice-v3-flashptthidvi