视频翻译配音工具 | 自动翻译视频语音并生成多语言配音 | AI视频本地化

video-translation by noizai/skills

1,300 周安装量

402 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/noizai/skills --skill video-translation

AI/机器学习音频处理

🇨🇳中文介绍

视频翻译

将视频中的语音翻译成另一种语言，使用 TTS 生成配音音频并替换原始音轨。

触发器

translate this video
dub this video to English
把视频从 X 语译成 Y 语
视频翻译

使用场景

用户希望观看外语 YouTube 视频，但更愿意听到母语配音。
用户提供视频链接并明确要求更改音频语言。

工作流程

当用户请求翻译视频时：

下载视频和字幕：使用 youtube-downloader 技能下载视频及其 SRT 格式的字幕。确保指定源语言以获取正确的字幕。

python path/to/youtube-downloader/scripts/download_video.py "VIDEO_URL" --subtitles --sub-lang <source_lang_code> -o /tmp/video-translation
翻译字幕：读取下载的 .srt 文件。使用以下固定提示词，将其内容逐句翻译成目标语言。保持完全相同的 SRT 索引和时间戳格式！

翻译提示词：

Translate the following subtitle text from to . Provide ONLY the translated text. Do not explain, do not add notes, do not add index numbers. The translation must be colloquial, natural-sounding, and suitable for video dubbing.

将翻译后的文本保存到新文件 translated.srt。

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

相关 Skills

find-skills 技能搜索工具 - Vercel Labs 开源智能体技能包管理器

705,000 周安装

React 组合模式指南：Vercel 组件架构最佳实践，提升代码可维护性

99,500 周安装

AI 代码实施计划编写技能 | 自动化开发任务分解与 TDD 流程规划工具

37,100 周安装

超能力技能使用指南：AI助手技能调用优先级与工作流程详解

33,600 周安装

executing-plans 技能：AI 辅助代码执行与计划实施工具 | Superpowers 开发助手

生成配音音频：使用 tts 技能根据翻译后的 SRT 文件渲染时间轴精确的音频。Noiz 后端会自动将每个句子的时长与原视频字幕时间戳对齐。

为确保克隆的语音与原始说话者在每个句子中的确切语调和情感相匹配，请将原始视频文件传递给 --ref-audio-track。TTS 引擎将自动在每个字幕的精确时间戳处切片原始音频，并将其用作该特定片段的参考。

创建一个基本的 voice_map.json：

    {
  "default": {
    "target_lang": "<target_lang_code>"
  }
}

渲染时间轴精确的音频：

    bash skills/tts/scripts/tts.sh render --srt translated.srt --voice-map voice_map.json --backend noiz --auto-emotion --ref-audio-track original_video.mp4 -o dubbed.wav

4. 替换视频中的音频：使用 replace_audio.sh 脚本将原始视频与新的配音音频合并。为了保留原始视频在翻译片段之外的非语音背景音，请传递 --srt 文件。

    bash skills/video-translation/scripts/replace_audio.sh --video original_video.mp4 --audio dubbed.wav --output final_video.mp4 --srt translated.srt

5. 呈现结果：将 final_video.mp4 文件路径返回给用户。

必需输入：
- VIDEO_URL：要翻译的视频的 URL。
- target_language：要将音频翻译成的语言。
可选输入：
- source_language：原始视频的语言（如果未自动检测或指定）。
- reference_audio：用于语音克隆的特定音频文件/URL，而不是动态的原始视频音轨。

成功：替换音频后的最终视频文件路径。
失败：明确指出是下载、TTS 还是音频替换失败的清晰错误信息。

依赖项（其他技能）
- youtube-downloader (crazynomad/skills) — SKILL.md
  安装：从 crazynomad/skills 克隆或复制 skills/youtube-downloader 目录到你的 skills/ 文件夹，确保 skills/youtube-downloader/scripts/download_video.py 可用。
- tts (NoizAI/skills) — SKILL.md
  如果此仓库中尚未存在：从 NoizAI/skills 克隆或复制 skills/tts 目录到你的 skills/ 文件夹。确保 skills/tts/scripts/tts.sh 及相关脚本存在。
为 Noiz 后端配置 NOIZ_API_KEY。如果未设置，首先引导用户从 https://developers.noiz.ai/api-keys 获取 API 密钥。用户提供密钥后，询问他们是否希望持久保存；如果他们同意，可以在项目的 .env 文件中写入/更新 NOIZ_API_KEY=...，或者运行 bash skills/tts/scripts/tts.sh config --set-api-key YOUR_KEY 来存储它。
已安装 ffmpeg。

源视频必须在平台上提供源语言的字幕（或自动生成的字幕）。
非常长的视频可能需要大量时间来翻译和配音。

🇺🇸English

Video Translation

Translate a video's speech into another language, using TTS to generate the dubbed audio and replacing the original audio track.

Triggers

translate this video
dub this video to English
把视频从 X 语译成 Y 语
视频翻译

Use Cases

The user wants to watch a foreign language YouTube video but prefers to hear it in their native language.
The user provides a video link and explicitly requests changing the audio language.

Workflow

When the user asks to translate a video:

Download Video & Subtitles: Use the youtube-downloader skill to download the video and its subtitles as SRT. Make sure you specify the source language to fetch the correct subtitle.

python path/to/youtube-downloader/scripts/download_video.py "VIDEO_URL" --subtitles --sub-lang <source_lang_code> -o /tmp/video-translation
Translate Subtitles : Read the downloaded .srt file. Translate its contents sentence by sentence into the target language using the following fixed prompt. Keep the exact same SRT index and timestamp format!

Translation Prompt :

Translate the following subtitle text from to . Provide ONLY the translated text. Do not explain, do not add notes, do not add index numbers. The translation must be colloquial, natural-sounding, and suitable for video dubbing.

Save the translated text into a new file translated.srt.

Generate Dubbed Audio : Use the tts skill to render the timeline-accurate audio from the translated SRT. The Noiz backend automatically aligns the duration of each sentence to the original video's subtitle timestamps.

To ensure the cloned voice matches the original speaker's exact tone and emotion for each sentence, pass the original video file to --ref-audio-track. The TTS engine will automatically slice the original audio at each subtitle's exact timestamp and use it as the reference for that specific segment.

Create a basic voice_map.json:

    {
  "default": {
    "target_lang": "<target_lang_code>"
  }
}

Render the timeline-accurate audio:

    bash skills/tts/scripts/tts.sh render --srt translated.srt --voice-map voice_map.json --backend noiz --auto-emotion --ref-audio-track original_video.mp4 -o dubbed.wav

4. Replace Audio in Video : Use the replace_audio.sh script to merge the original video with the new dubbed audio. To keep the original video's non-speech audio background outside of translated segments, pass the --srt file.

    bash skills/video-translation/scripts/replace_audio.sh --video original_video.mp4 --audio dubbed.wav --output final_video.mp4 --srt translated.srt

5. Present the Result : Return the final_video.mp4 file path to the user.

Inputs

Required inputs :
- VIDEO_URL: The URL of the video to translate.
- target_language: The language to translate the audio to.
Optional inputs :
- source_language: The language of the original video (if not auto-detected or specified).
- reference_audio: Specific audio file/URL to use for voice cloning instead of the dynamic original video track.

Outputs

Success: Path to the final video file with replaced audio.
Failure: Clear error message specifying whether download, TTS, or audio replacement failed.

Requirements

Dependencies (other skills)
- youtube-downloader (crazynomad/skills) — SKILL.md
  Install: clone or copy the skills/youtube-downloader directory from crazynomad/skills into your skills/ folder so that skills/youtube-downloader/scripts/download_video.py is available.
- tts (NoizAI/skills) — SKILL.md
  If not already in this repo: clone or copy the skills/tts directory from NoizAI/skills into your skills/ folder. Ensure skills/tts/scripts/tts.sh and related scripts are present.

Limitations

The source video must have subtitles (or auto-generated subtitles) available on the platform for the source language.
Very long videos may take a significant amount of time to translate and dub.

Weekly Installs

1.3K

Repository

noizai/skills

GitHub Stars

402

First Seen

Mar 2, 2026

Security Audits

Gen Agent Trust HubWarn SocketPass SnykWarn

Installed on

opencode1.3K

gemini-cli1.3K

kimi-cli1.3K

cline1.3K

cursor1.3K

github-copilot1.3K

NOIZ_API_KEY configured for the Noiz backend. If it is not set, first guide the user to get an API key from https://developers.noiz.ai/api-keys. After the user provides the key, ask whether they want to persist it; if they agree, either write/update NOIZ_API_KEY=... in the project's .env file or run bash skills/tts/scripts/tts.sh config --set-api-key YOUR_KEY to store it.