⚠️

重要前提

安装AI Skills的关键前提是：必须科学上网，且开启TUN模式，这一点至关重要，直接决定安装能否顺利完成，在此郑重提醒三遍：科学上网，科学上网，科学上网。查看完整安装教程 →

ElevenLabs API 使用指南：通过 curl 调用实现高质量 AI 文本转语音

elevenlabs by vm0-ai/vm0-skills

60 周安装量

52 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/vm0-ai/vm0-skills --skill elevenlabs

AI/机器学习音频处理 API

🇨🇳中文介绍

ElevenLabs API

通过直接的 curl 调用使用 ElevenLabs API，从文本生成逼真的 AI 语音。

官方文档：https://elevenlabs.io/docs/api-reference

使用时机

在以下场景中使用此技能：

使用高质量的 AI 语音将文本转换为语音
列出可用语音，为您的用例找到合适的语音
流式传输音频输出以实现实时播放
为视频、播客或无障碍功能生成语音旁白

先决条件

在 ElevenLabs 注册并创建账户
前往您的个人资料设置并生成一个 API 密钥
将您的 API 密钥存储在环境变量 ELEVENLABS_API_KEY 中

export ELEVENLABS_API_KEY="your-api-key"

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

1. 列出可用语音

获取您的账户可用的所有语音：

curl -s -X GET "https://api.elevenlabs.io/v1/voices" --header "xi-api-key: $(printenv ELEVENLABS_API_KEY)" | jq '.voices[] | {voice_id, name, category}'

这将返回文本转语音所需的语音 ID。常见的语音类别：

premade：ElevenLabs 默认语音
cloned：您克隆的语音
generated：AI 设计的语音

2. 获取语音详情

获取特定语音的详细信息。将 <your-voice-id> 替换为实际的语音 ID：

curl -s -X GET "https://api.elevenlabs.io/v1/voices/<your-voice-id>" --header "xi-api-key: $(printenv ELEVENLABS_API_KEY)"

3. 列出可用模型

获取所有可用的 TTS 模型：

curl -s -X GET "https://api.elevenlabs.io/v1/models" --header "xi-api-key: $(printenv ELEVENLABS_API_KEY)" | jq '.[] | {model_id, name, can_do_text_to_speech}'

eleven_multilingual_v2：最佳质量，支持 29 种语言
eleven_flash_v2_5：低延迟，适合实时应用
eleven_v3：最新模型（alpha 版）

4. 文本转语音（保存到文件）

将文本转换为语音并保存为 MP3。将 <your-voice-id> 替换为从“列出语音”端点获取的实际语音 ID：

写入 /tmp/elevenlabs_request.json：

{
  "text": "Hello! This is a test of the ElevenLabs text to speech API.",
  "model_id": "eleven_multilingual_v2",
  "voice_settings": {
    "stability": 0.5,
    "similarity_boost": 0.75
  }
}

curl -s -X POST "https://api.elevenlabs.io/v1/text-to-speech/<your-voice-id>" --header "xi-api-key: $(printenv ELEVENLABS_API_KEY)" --header "Content-Type: application/json" --header "Accept: audio/mpeg" -d @/tmp/elevenlabs_request.json --output speech.mp3

stability (0.0-1.0)：值越高越稳定一致，值越低表现力越强
similarity_boost (0.0-1.0)：值越高越接近原始语音

5. 流式文本转语音

流式传输音频以实现实时播放。将 <your-voice-id> 替换为实际的语音 ID：

写入 /tmp/elevenlabs_request.json：

{
  "text": "This audio is being streamed in real-time.",
  "model_id": "eleven_flash_v2_5"
}

curl -s -X POST "https://api.elevenlabs.io/v1/text-to-speech/<your-voice-id>/stream" --header "xi-api-key: $(printenv ELEVENLABS_API_KEY)" --header "Content-Type: application/json" --header "Accept: audio/mpeg" -d @/tmp/elevenlabs_request.json --output streamed.mp3

6. 获取用户订阅信息

检查您的使用情况和字符限制：

curl -s -X GET "https://api.elevenlabs.io/v1/user/subscription" --header "xi-api-key: $(printenv ELEVENLABS_API_KEY)" | jq '{character_count, character_limit, tier}'

您可以通过 output_format 查询参数指定不同的输出格式。将 <your-voice-id> 替换为实际的语音 ID：

写入 /tmp/elevenlabs_request.json：

{
  "text": "Hello world",
  "model_id": "eleven_multilingual_v2"
}

curl -s -X POST "https://api.elevenlabs.io/v1/text-to-speech/<your-voice-id>?output_format=pcm_16000" --header "xi-api-key: $(printenv ELEVENLABS_API_KEY)" --header "Content-Type: application/json" -d @/tmp/elevenlabs_request.json --output speech.pcm

mp3_44100_192 (默认)：44.1kHz，192kbps 的 MP3
mp3_44100_128：44.1kHz，128kbps 的 MP3
pcm_16000：16kHz 的 PCM
pcm_22050：22.05kHz 的 PCM
pcm_24000：24kHz 的 PCM

选择合适的模型：低延迟场景使用 eleven_flash_v2_5，追求最佳质量使用 eleven_multilingual_v2
监控使用量：检查订阅端点，避免超出字符限制
尝试调整语音设置：调整稳定性和相似度增强以获得不同效果
长文本使用流式传输：流式端点更适合实时应用
缓存语音 ID：存储常用语音 ID，避免重复的 API 调用

2026 年 1 月 24 日

🇺🇸English

ElevenLabs API

Use the ElevenLabs API via direct curl calls to generate realistic AI speech from text.

Official docs: https://elevenlabs.io/docs/api-reference

When to Use

Use this skill when you need to:

Convert text to speech with high-quality AI voices
List available voices to find the right voice for your use case
Stream audio output for real-time playback
Generate voiceovers for videos, podcasts, or accessibility

Prerequisites

Sign up at ElevenLabs and create an account
Go to your profile settings and generate an API key
Store your API key in the environment variable ELEVENLABS_API_KEY

export ELEVENLABS_API_KEY="your-api-key"

API Limits

Free tier: limited characters per month
API key is passed via the xi-api-key header

How to Use

All examples below assume you have ELEVENLABS_API_KEY set.

The base URL for the ElevenLabs API is:

https://api.elevenlabs.io/v1

1. List Available Voices

Get all voices available to your account:

curl -s -X GET "https://api.elevenlabs.io/v1/voices" --header "xi-api-key: $(printenv ELEVENLABS_API_KEY)" | jq '.voices[] | {voice_id, name, category}'

This returns voice IDs needed for text-to-speech. Common voice categories:

premade: ElevenLabs default voices
cloned: Your cloned voices
generated: AI-designed voices

2. Get Voice Details

Get detailed information about a specific voice. Replace <your-voice-id> with an actual voice ID:

curl -s -X GET "https://api.elevenlabs.io/v1/voices/<your-voice-id>" --header "xi-api-key: $(printenv ELEVENLABS_API_KEY)"

3. List Available Models

Get all available TTS models:

curl -s -X GET "https://api.elevenlabs.io/v1/models" --header "xi-api-key: $(printenv ELEVENLABS_API_KEY)" | jq '.[] | {model_id, name, can_do_text_to_speech}'

Common models:

eleven_multilingual_v2: Best quality, supports 29 languages
eleven_flash_v2_5: Low latency, good for real-time
eleven_v3: Latest model (alpha)

4. Text to Speech (Save to File)

Convert text to speech and save as MP3. Replace <your-voice-id> with an actual voice ID from the list voices endpoint:

Write to /tmp/elevenlabs_request.json:

{
  "text": "Hello! This is a test of the ElevenLabs text to speech API.",
  "model_id": "eleven_multilingual_v2",
  "voice_settings": {
    "stability": 0.5,
    "similarity_boost": 0.75
  }
}

Then run:

curl -s -X POST "https://api.elevenlabs.io/v1/text-to-speech/<your-voice-id>" --header "xi-api-key: $(printenv ELEVENLABS_API_KEY)" --header "Content-Type: application/json" --header "Accept: audio/mpeg" -d @/tmp/elevenlabs_request.json --output speech.mp3

Voice settings:

stability (0.0-1.0): Higher = more consistent, lower = more expressive
similarity_boost (0.0-1.0): Higher = closer to original voice

5. Text to Speech with Streaming

Stream audio for real-time playback. Replace <your-voice-id> with an actual voice ID:

Write to /tmp/elevenlabs_request.json:

{
  "text": "This audio is being streamed in real-time.",
  "model_id": "eleven_flash_v2_5"
}

Then run:

curl -s -X POST "https://api.elevenlabs.io/v1/text-to-speech/<your-voice-id>/stream" --header "xi-api-key: $(printenv ELEVENLABS_API_KEY)" --header "Content-Type: application/json" --header "Accept: audio/mpeg" -d @/tmp/elevenlabs_request.json --output streamed.mp3

6. Get User Subscription Info

Check your usage and character limits:

curl -s -X GET "https://api.elevenlabs.io/v1/user/subscription" --header "xi-api-key: $(printenv ELEVENLABS_API_KEY)" | jq '{character_count, character_limit, tier}'

Output Formats

You can specify different output formats via the output_format query parameter. Replace <your-voice-id> with an actual voice ID:

Write to /tmp/elevenlabs_request.json:

{
  "text": "Hello world",
  "model_id": "eleven_multilingual_v2"
}

Then run:

curl -s -X POST "https://api.elevenlabs.io/v1/text-to-speech/<your-voice-id>?output_format=pcm_16000" --header "xi-api-key: $(printenv ELEVENLABS_API_KEY)" --header "Content-Type: application/json" -d @/tmp/elevenlabs_request.json --output speech.pcm

Available formats:

mp3_44100_192 (default): MP3 at 44.1kHz, 192kbps
mp3_44100_128: MP3 at 44.1kHz, 128kbps
pcm_16000: PCM at 16kHz
pcm_22050: PCM at 22.05kHz
pcm_24000: PCM at 24kHz

Guidelines

Choose the right model : Use eleven_flash_v2_5 for low latency, eleven_multilingual_v2 for best quality
Monitor usage : Check subscription endpoint to avoid exceeding character limits
Experiment with voice settings : Adjust stability and similarity_boost for different effects
Use streaming for long text : Stream endpoint is better for real-time applications
Cache voice IDs : Store frequently used voice IDs to avoid repeated API calls

Weekly Installs

Repository

vm0-ai/vm0-skills

GitHub Stars

First Seen

Jan 24, 2026

Security Audits

Gen Agent Trust HubPass SocketPass SnykPass

Installed on

gemini-cli52

opencode51

codex50

cursor49

amp49

cline49

SoulTrace 人格评估 API - 基于五色心理模型的贝叶斯自适应测试

56,700 周安装

ElevenLabs API 使用指南：通过 curl 调用实现高质量 AI 文本转语音

🇨🇳中文介绍

ElevenLabs API

使用时机

先决条件

相关 Skills

API 限制

使用方法

1. 列出可用语音

2. 获取语音详情

3. 列出可用模型

4. 文本转语音（保存到文件）

5. 流式文本转语音

6. 获取用户订阅信息

输出格式

指南

🇺🇸English

ElevenLabs API

When to Use

Prerequisites

API Limits

How to Use

1. List Available Voices

2. Get Voice Details

3. List Available Models

4. Text to Speech (Save to File)

5. Text to Speech with Streaming

6. Get User Subscription Info

Output Formats

Guidelines

最新 Skills