重要前提
安装AI Skills的关键前提是:必须科学上网,且开启TUN模式,这一点至关重要,直接决定安装能否顺利完成,在此郑重提醒三遍:科学上网,科学上网,科学上网。查看完整安装教程 →
npx skills add https://github.com/guia-matthieu/clawfu-skills --skill audio-editing掌握音频后期制作的核心技术——标准化、压缩、均衡和降噪——通过正确的处理顺序,实现专业品质的音频。
来源 : iZotope + 行业最佳实践
核心原则 : 音频处理必须遵循正确的顺序——每一步都建立在前一步的基础上。"在压缩前进行降噪可以防止放大噪音。在均衡前进行压缩可以防止破坏你的电平工作。" 目标是服务于内容,而不是展示处理效果。
为何重要 : 糟糕的音频编辑是导致原本优秀的内容听起来很业余的最常见原因。理解这些基础知识使营销人员能够自己打磨录音,或与音频工程师进行有效沟通。
| Claude 负责 | 您决定 |
|---|---|
| 构建制作工作流程 | 最终的创意方向 |
| 建议技术方法 | 设备和工具选择 |
| 创建模板和检查清单 | 质量标准 |
| 识别最佳实践 | 品牌/声音决策 |
| 生成脚本大纲 | 最终脚本批准 |
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
My audio has [describe problem: too quiet, noisy background, inconsistent levels, muddy sound].
Help me fix it using proper processing order.
Help me prepare this audio for [podcast/YouTube/Spotify/broadcast].
Current state: [describe audio]
Create an audio editing workflow for [content type].
Include settings for [software: Audacity/Audition/etc.]
编辑音频时,请遵循此方法:
始终按此顺序处理,以避免问题叠加。
## 正确的处理顺序
1. 增益分级
↓
2. 降噪
↓
3. 压缩
↓
4. 均衡
↓
5. 最终标准化 / 限制
为何是这个顺序:
- 在压缩前降噪:防止放大噪音
- 在均衡前压缩:防止均衡改变影响动态
- 限制最后:在所有处理后设置最终上限
在任何处理之前设置初始电平。
## 增益分级指南
**录音(录制时的目标)**:
- 峰值在 -12 到 -6 dB
- 为处理留出余量
**初始标准化(编辑开始时)**:
- 将峰值标准化到 -6 dB
- 创建一致的起点
**两种标准化类型**:
1. **峰值标准化**
- 基于最响点调整
- 用于:初始增益分级
- 不改变动态范围
2. **RMS/响度标准化**
- 基于平均电平调整
- 用于:最终交付
- 更适合感知响度匹配
特定工具 :
| 软件 | 标准化功能 |
|---|---|
| Audacity | 效果 → 标准化 |
| Audition | 效果 → 振幅 → 标准化 |
| Logic Pro | 区域 → 标准化 |
移除不需要的背景声音,不产生伪影。
## 降噪方法
**何时使用**:
- 持续的背景嘶嘶声/嗡嗡声
- 空调、电脑风扇噪音
- 不适用于可变噪音(交通、人声)
**方法 1:频谱降噪**
1. 找到 2-3 秒的"静音"(仅噪音)
2. 用作噪音样本
3. 对整个音轨应用降噪
4. 使用保守的设置
**设置指南** (Audacity 示例):
- 降噪:6-12 dB (从低开始)
- 灵敏度:4-6 (越高越激进)
- 频率平滑:3-6 个频段
**方法 2:噪音门**
- 设置阈值;低于阈值的音频被静音
- 更适合处理语音间的呼吸声
- 不影响语音期间的音频
**过度处理的警告信号**:
- "水下"或"机器人"般的声音
- 旋转伪影
- 词语间不自然的静默
**规则**:如果要在轻微噪音和伪影之间选择,保留噪音。
平衡动态——降低响的部分,提升安静的部分。
## 人声压缩
**作用**:
- 降低超过阈值的音量
- 使声音更一致、更饱满
**关键参数**:
| 参数 | 作用 | 人声设置 |
|-----------|--------------|---------------|
| 阈值 | 压缩开始的电平 | -20 到 -12 dB |
| 压缩比 | 降低多少 | 2:1 到 4:1 |
| 启动时间 | 压缩启动速度 | 10-30 ms |
| 释放时间 | 压缩停止速度 | 100-300 ms |
| 增益补偿 | 压缩后提升输出 | 根据听感调整 |
**人声压缩起始点**:
- 阈值:-18 dB
- 压缩比:3:1
- 启动时间:15 ms (对瞬态足够快)
- 释放时间:150 ms
- 增益:+3-6 dB (补偿压缩降低)
**多段压缩** (高级):
- 对不同频率范围使用不同设置
- 用于控制低频隆隆声而不影响高频
- 对大多数营销音频来说过于复杂
**何时不使用压缩**:
- 已经一致的音频(录制良好)
- 需要保持动态的音乐
- 过度压缩听起来"被压扁"
塑造音色——削减问题,增强清晰度。
## 人声均衡
**理念**:多削减,少提升。移除问题比添加"美化"更安全。
**人声频率指南**:
| 范围 | 频率 | 效果 |
|-------|-----------|--------|
| 隆隆声 | 80 Hz 以下 | 削减 (高通滤波器) |
| 浑浊感 | 200-400 Hz | 如果声音沉闷则削减 |
| 厚度/温暖感 | 150-250 Hz | 如果声音单薄则轻微提升 |
| 沉闷/鼻音 | 400-800 Hz | 如果声音发闷则削减 |
| 清晰度/临场感 | 2-4 kHz | 提升可懂度 |
| 齿音 | 5-8 kHz | 如果"嘶"声刺耳则削减 |
| 空气感/明亮度 | 8-12 kHz | 提升以获得高端录音棚感觉 |
**标准人声均衡配方**:
1. 在 80 Hz 处应用高通滤波器 (移除隆隆声)
2. 在 300 Hz 附近削减 2-3 dB (减少浑浊感)
3. 在 3 kHz 附近提升 2-3 dB (增加清晰度)
4. 如果需要,在 10 kHz 处应用高频搁架式提升 (增加空气感)
**齿音消除**:
- 驯服刺耳的"s"和"sh"音
- 目标:5-8 kHz 范围
- 使用齿音消除插件或窄带均衡削减
为交付平台设置输出电平。
## 各平台响度标准
| 平台 | 目标 | 测量方式 |
|----------|--------|-------------|
| 播客 (通用) | -16 LUFS | 综合响度 |
| Spotify/Apple Podcasts | -14 LUFS | 可能进一步标准化 |
| YouTube | -14 LUFS | 推荐 |
| 广播电视 (美国) | -24 LKFS | FCC 规定 |
| ACX/有声书 | -23 到 -18 dB RMS | 每章节 |
**最终步骤**:
1. **响度标准化**
- 匹配平台标准 (例如,-16 LUFS)
- 使用响度计,而非峰值计
2. **限制**
- 防止削波
- 将上限设置为 -1 dB (为编码留出余量)
- 捕捉超过上限的峰值
3. **导出设置**
- 播客:128-192 kbps MP3,立体声
- 视频:匹配视频项目设置
- 有声书:192 kbps MP3,44.1 kHz,单声道
背景 : 录制了一次远程播客采访,有明显的背景嗡嗡声。
输入 :
My podcast guest recorded on their laptop mic. There's a consistent low hum and the levels are all over the place—sometimes quiet, sometimes too loud. How do I fix this?
输出 :
1. 增益分级
2. 移除嗡嗡声
如果是电气嗡嗡声 (恒定音高) :
如果是宽带噪音 (嘶嘶声) :
3. 平衡电平
压缩设置:
4. 修复单薄的声音
均衡调整:
5. 最终输出
背景 : 为不同类型的音频内容设置模板。
输入 :
I produce three types of content: YouTube videos, podcast episodes, and Loom videos for sales. Can you give me processing templates for each?
输出 :
目标 : 背景音乐上清晰的人声,在手机扬声器和耳机上听起来都很好
处理链 :
导出 : -14 LUFS,立体声,匹配视频编解码器
人声/音乐平衡 : 人声在 -12 dB,音乐在 -20 到 -24 dB (比人声低 8-12 dB)
目标 : 亲密、一致的声音,适合长时间耳机聆听
处理链 :
导出 : -16 LUFS,128-192 kbps MP3,立体声或单声道
多说话人 : 分别处理每个音轨,然后平衡 (合在一起时响度应相等)
目标 : 专业但自然,注重可懂度,针对笔记本电脑扬声器优化
处理链 :
导出 : -14 LUFS,为文件大小优化 (可接受较低的比特率)
专业提示 : 在笔记本电脑扬声器上测试播放,而不是在监听音箱上——买家就是这样听的
## 预处理
□ 已将音频导入项目
□ 完整听一遍以发现问题
□ 记录具体问题 (噪音、爆音、音量峰值)
□ 备份了原始文件
## 处理 (按顺序)
□ 1. 增益分级:峰值在 -6 dB
□ 2. 应用了降噪 (如果需要)
□ - 使用了干净的噪音样本
□ - 检查了伪影
□ 3. 应用了压缩
□ - 阈值设置合适
□ - 增益降低 3-6 dB 为典型值
□ 4. 应用了均衡
□ - 启用了高通
□ - 削减了问题频率
□ 5. 最终限制
□ - 上限在 -1 dB (或根据平台)
## 质量检查
□ 与旁通效果进行 A/B 对比
□ 用耳机聆听
□ 用不同扬声器聆听
□ 没有伪影或处理痕迹
□ 响度符合目标规格
## 快速参考:交付规格
播客
- 响度:-16 LUFS
- 格式:128-192 kbps MP3
- 声道:单声道或立体声
YOUTUBE
- 响度:-14 LUFS
- 格式:匹配视频设置
- 注意:平台会进行标准化
有声书 (ACX)
- RMS:-23 到 -18 dB
- 峰值:最大 -3 dB
- 本底噪音:-60 dB
- 格式:192 kbps MP3,44.1 kHz,单声道
广播电视 (美国)
- 响度:-24 LKFS
- 真实峰值:-2 dB
- 注意:FCC 规定
音乐流媒体
- 响度:-14 LUFS (Spotify 参考)
- 平台会标准化,但母带通常更响
name: audio-editing
category: audio
subcategory: editing
version: 1.0
author: MKTG Skills
source_expert: iZotope, Industry Best Practices
source_work: Audio Engineering Standards
difficulty: beginner
estimated_value: $50-200 per hour (equivalent engineering time)
tags: [audio, editing, eq, compression, normalization, post-production]
created: 2026-01-26
updated: 2026-01-26
每周安装次数
53
代码库
GitHub 星标数
47
首次出现
2026年2月13日
安全审计
安装于
gemini-cli51
opencode51
codex50
cursor49
github-copilot49
amp48
Master the essential audio post-production techniques—normalization, compression, EQ, and noise reduction—using the correct processing order to achieve professional-quality audio.
Source : iZotope + Industry Best Practices
Core Principle : Audio processing must happen in the correct order—each step builds on the previous. "Noise reduction before compression prevents amplifying noise. Compression before EQ prevents undoing your level work." The goal is to serve the content, not showcase the processing.
Why This Matters : Poor audio editing is the most common reason otherwise good content sounds amateur. Understanding these fundamentals enables marketers to polish recordings themselves or effectively communicate with audio engineers.
| Claude Does | You Decide |
|---|---|
| Structures production workflow | Final creative direction |
| Suggests technical approaches | Equipment and tool choices |
| Creates templates and checklists | Quality standards |
| Identifies best practices | Brand/voice decisions |
| Generates script outlines | Final script approval |
My audio has [describe problem: too quiet, noisy background, inconsistent levels, muddy sound].
Help me fix it using proper processing order.
Help me prepare this audio for [podcast/YouTube/Spotify/broadcast].
Current state: [describe audio]
Create an audio editing workflow for [content type].
Include settings for [software: Audacity/Audition/etc.]
When editing audio, follow this methodology:
Always process in this sequence to avoid compounding problems.
## Correct Processing Order
1. GAIN STAGING
↓
2. NOISE REDUCTION
↓
3. COMPRESSION
↓
4. EQUALIZATION
↓
5. FINAL NORMALIZATION / LIMITING
Why this order:
- Noise reduction BEFORE compression: Prevents amplifying noise
- Compression BEFORE EQ: Prevents EQ changes affecting dynamics
- Limiting LAST: Sets final ceiling after all processing
Set initial levels before any processing.
## Gain Staging Guidelines
**Recording (target during capture)**:
- Peaks at -12 to -6 dB
- Leaves headroom for processing
**Initial Normalization (start of editing)**:
- Normalize peaks to -6 dB
- Creates consistent starting point
**Two Types of Normalization**:
1. **Peak Normalization**
- Adjusts based on loudest point
- Use for: Initial gain staging
- Does NOT change dynamic range
2. **RMS/Loudness Normalization**
- Adjusts based on average level
- Use for: Final delivery
- Better for perceived loudness matching
Tool-Specific :
| Software | Normalize Function |
|---|---|
| Audacity | Effect → Normalize |
| Audition | Effects → Amplitude → Normalize |
| Logic Pro | Region → Normalize |
Remove unwanted background sound without artifacts.
## Noise Reduction Approach
**When to use**:
- Consistent background hiss/hum
- Air conditioning, computer fan noise
- Not for variable noise (traffic, voices)
**Method 1: Spectral Noise Reduction**
1. Find 2-3 seconds of "silence" (noise only)
2. Use as noise profile
3. Apply reduction to full track
4. Use conservative settings
**Settings Guide** (Audacity example):
- Noise Reduction: 6-12 dB (start low)
- Sensitivity: 4-6 (higher = more aggressive)
- Frequency Smoothing: 3-6 bands
**Method 2: Noise Gate**
- Sets threshold; audio below is silenced
- Better for breaths between speech
- Doesn't affect audio during speech
**Warning Signs of Over-Processing**:
- "Underwater" or "robotic" sound
- Swirling artifacts
- Unnatural silence between words
**Rule**: If choosing between slight noise or artifacts, keep the noise.
Even out dynamics—reduce loud parts, bring up quiet parts.
## Compression for Voice
**What It Does**:
- Reduces volume of sounds above threshold
- Results in more consistent, fuller sound
**Key Parameters**:
| Parameter | What It Does | Voice Setting |
|-----------|--------------|---------------|
| Threshold | Level where compression starts | -20 to -12 dB |
| Ratio | How much to reduce | 2:1 to 4:1 |
| Attack | How fast compression kicks in | 10-30 ms |
| Release | How fast compression stops | 100-300 ms |
| Makeup Gain | Boosts output after compression | To taste |
**Voice Compression Starting Point**:
- Threshold: -18 dB
- Ratio: 3:1
- Attack: 15 ms (fast enough for transients)
- Release: 150 ms
- Gain: +3-6 dB (compensate for reduction)
**Multi-Band Compression** (advanced):
- Different settings for different frequency ranges
- Useful for controlling low-end rumble without affecting highs
- Overkill for most marketing audio
**When NOT to Compress**:
- Already consistent audio (well-recorded)
- Music meant to be dynamic
- Over-compression sounds "squashed"
Shape the tone—cut problems, enhance clarity.
## EQ for Voice
**Philosophy**: Cut more than boost. Removing problems is safer than adding "goodness."
**Voice Frequency Guide**:
| Range | Frequency | Effect |
|-------|-----------|--------|
| Rumble | Below 80 Hz | Cut (high-pass filter) |
| Muddiness | 200-400 Hz | Cut if boomy |
| Body/Warmth | 150-250 Hz | Boost slightly for thin voice |
| Boxy/Nasal | 400-800 Hz | Cut if honky |
| Clarity/Presence | 2-4 kHz | Boost for intelligibility |
| Sibilance | 5-8 kHz | Cut if harsh "s" sounds |
| Air/Brightness | 8-12 kHz | Boost for expensive studio feel |
**Standard Voice EQ Recipe**:
1. High-pass filter at 80 Hz (removes rumble)
2. Cut 2-3 dB around 300 Hz (reduces muddiness)
3. Boost 2-3 dB around 3 kHz (adds clarity)
4. High-shelf boost at 10 kHz if needed (adds air)
**De-essing**:
- Tames harsh "s" and "sh" sounds
- Target: 5-8 kHz range
- Use de-esser plugin OR narrow EQ cut
Set output level for delivery platform.
## Loudness Standards by Platform
| Platform | Target | Measurement |
|----------|--------|-------------|
| Podcast (general) | -16 LUFS | Integrated loudness |
| Spotify/Apple Podcasts | -14 LUFS | May normalize further |
| YouTube | -14 LUFS | Recommended |
| Broadcast TV (US) | -24 LKFS | FCC regulation |
| ACX/Audiobooks | -23 to -18 dB RMS | Per chapter |
**Final Steps**:
1. **Loudness Normalization**
- Match to platform standard (e.g., -16 LUFS)
- Use loudness meter, not peak meter
2. **Limiting**
- Prevents clipping
- Set ceiling at -1 dB (leaves headroom for encoding)
- Catches peaks that exceed ceiling
3. **Export Settings**
- Podcast: 128-192 kbps MP3, stereo
- Video: Match video project settings
- Audiobook: 192 kbps MP3, 44.1 kHz, mono
Context : Recorded a remote podcast interview with audible background hum.
Input :
My podcast guest recorded on their laptop mic. There's a consistent low hum and the levels are all over the place—sometimes quiet, sometimes too loud. How do I fix this?
Output :
1. Gain Staging
2. Remove the Hum
If it's electrical hum (consistent pitch) :
If it's broadband noise (hiss) :
3. Even Out Levels
Compression settings:
4. Fix the Thin Sound
EQ adjustments:
5. Final Output
Context : Setting up templates for different audio content.
Input :
I produce three types of content: YouTube videos, podcast episodes, and Loom videos for sales. Can you give me processing templates for each?
Output :
Goal : Clear voice over background music, sounds good on phone speakers and headphones
Processing Chain :
Export : -14 LUFS, stereo, match video codec
Voice/Music Balance : Voice at -12 dB, music at -20 to -24 dB (8-12 dB lower than voice)
Goal : Intimate, consistent sound for headphone listening over long duration
Processing Chain :
Export : -16 LUFS, 128-192 kbps MP3, stereo or mono
Multi-Speaker : Process each track separately, then balance (should be equal loudness when together)
Goal : Professional but natural, focus on intelligibility, optimize for laptop speakers
Processing Chain :
Export : -14 LUFS, optimize for file size (lower bitrate acceptable)
Pro tip : Test playback on laptop speakers, not studio monitors—that's how buyers will hear it
## Pre-Processing
□ Imported audio to project
□ Listened through once for problems
□ Noted specific issues (noise, pops, volume spikes)
□ Backed up original file
## Processing (in order)
□ 1. Gain staging: peaks at -6 dB
□ 2. Noise reduction applied (if needed)
□ - Used clean noise sample
□ - Checked for artifacts
□ 3. Compression applied
□ - Threshold set appropriately
□ - Gain reduction 3-6 dB typical
□ 4. EQ applied
□ - High-pass engaged
□ - Problem frequencies cut
□ 5. Final limiting
□ - Ceiling at -1 dB (or per platform)
## Quality Check
□ A/B comparison with bypass
□ Listened on headphones
□ Listened on different speakers
□ No artifacts or processing sounds
□ Loudness matches target spec
## Quick Reference: Delivery Specs
PODCASTS
- Loudness: -16 LUFS
- Format: 128-192 kbps MP3
- Channels: Mono or Stereo
YOUTUBE
- Loudness: -14 LUFS
- Format: Match video settings
- Note: Will be normalized by platform
AUDIOBOOKS (ACX)
- RMS: -23 to -18 dB
- Peak: -3 dB max
- Noise floor: -60 dB
- Format: 192 kbps MP3, 44.1 kHz, mono
BROADCAST (US)
- Loudness: -24 LKFS
- True peak: -2 dB
- Note: FCC regulated
MUSIC STREAMING
- Loudness: -14 LUFS (Spotify reference)
- Platforms normalize, but masters are louder
name: audio-editing
category: audio
subcategory: editing
version: 1.0
author: MKTG Skills
source_expert: iZotope, Industry Best Practices
source_work: Audio Engineering Standards
difficulty: beginner
estimated_value: $50-200 per hour (equivalent engineering time)
tags: [audio, editing, eq, compression, normalization, post-production]
created: 2026-01-26
updated: 2026-01-26
Weekly Installs
53
Repository
GitHub Stars
47
First Seen
Feb 13, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykPass
Installed on
gemini-cli51
opencode51
codex50
cursor49
github-copilot49
amp48
Python PDF处理教程:合并拆分、提取文本表格、创建PDF文件
69,800 周安装
OpenAI API 完整文档技能 - 官方文档集成与智能问答助手
1 周安装
Next.js 官方文档中文指南 - 从入门到精通,掌握App Router、数据获取与性能优化
1 周安装
Hono 框架中文文档 | 轻量级 Web 框架,支持 Bun、Deno、Cloudflare Workers
1 周安装
Express.js 全面中文文档与 API 参考 | 涵盖安全漏洞、性能优化与迁移指南
1 周安装
Drizzle ORM 完整文档 | 无头 ORM 与类 SQL 查询指南
1 周安装
Cortex 文档大全 | 集成指南与 API 参考 | 涵盖 FireHydrant、ServiceNow、Datadog 等
1 周安装