Claude-skill-registry Audio Mixing & Mastering

Mix voice, binaural, and SFX stems into final master

install
source · Clone the upstream repo
git clone https://github.com/majiayu000/claude-skill-registry
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/majiayu000/claude-skill-registry "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/data/audio-mixing" ~/.claude/skills/majiayu000-claude-skill-registry-audio-mixing-mastering && rm -rf "$T"
manifest: skills/data/audio-mixing/SKILL.md
source content

Audio Mixing Skill

The Mix Is Where Magic Happens

This skill handles combining voice, binaural beats, and sound effects into a cohesive, hypnotic audio experience.


Purpose

Mix multiple audio stems at correct levels and apply hypnotic post-processing for production masters.


Standard Stem Levels

CRITICAL: Always use these exact levels.

StemLevelRationale
Voice-6 dBReference level, prevents clipping
Binaural-6 dBAudible but not distracting
SFX0 dBFull impact for transitions

Canonical Mix Command

ffmpeg -y \
  -i sessions/{session}/output/voice_enhanced.wav \
  -i sessions/{session}/output/binaural_dynamic.wav \
  -i sessions/{session}/output/sfx_track.wav \
  -filter_complex \
    "[0:a]volume=-6dB[voice]; \
     [1:a]volume=-6dB[bin]; \
     [2:a]volume=0dB[sfx]; \
     [voice][bin][sfx]amix=inputs=3:duration=longest:normalize=0[mixed]" \
  -map "[mixed]" \
  -acodec pcm_s16le \
  sessions/{session}/output/session_mixed.wav

Important: Use

normalize=0
to prevent unpredictable level changes.


Input Stems

StemFileRequired
Voice
voice_enhanced.wav
Yes
Binaural
binaural_dynamic.wav
Yes
SFX
sfx_track.wav
Optional

Never use

voice.mp3
- always use the enhanced version.


Hypnotic Post-Processing (MANDATORY)

After mixing, apply psychoacoustic mastering:

python3 scripts/core/hypnotic_post_process.py --session sessions/{session}/

Triple-Layer Hypnotic Presence

LayerEnhancementLevelEffect
1Whisper Overlay-22 dBEthereal presence
2Subharmonic-12 dBGrounding presence
3Double-Voice-14 dB, 8ms delaySubliminal presence

Additional Enhancements

EnhancementDescriptionDefault
Tape WarmthAnalog saturation25% drive
De-essingSibilance reduction4-8 kHz
Room ToneGentle reverb4% wet
Cuddle WavesAmplitude modulation0.05 Hz, ±1.5 dB
EchoSubtle depth180ms, 25% decay

Output Files

FilePurpose
session_mixed.wav
Pre-master mix (intermediate)
{session}_MASTER.mp3
Final deliverable (320 kbps)
{session}_MASTER.wav
Archive master (24-bit)

Two-Stem Mix (No SFX)

When no SFX track is needed:

ffmpeg -y \
  -i sessions/{session}/output/voice_enhanced.wav \
  -i sessions/{session}/output/binaural_dynamic.wav \
  -filter_complex \
    "[0:a]volume=-6dB[voice]; \
     [1:a]volume=-6dB[bin]; \
     [voice][bin]amix=inputs=2:duration=longest:normalize=0[mixed]" \
  -map "[mixed]" \
  -acodec pcm_s16le \
  sessions/{session}/output/session_mixed.wav

Level Verification

After mixing, check levels:

# Check peak level (should be < 0 dB)
ffmpeg -i session_mixed.wav -af "volumedetect" -f null /dev/null 2>&1 | grep max_volume

# Check LUFS (target: -14 LUFS)
ffmpeg -i session_mixed.wav -af "loudnorm=print_format=json" -f null /dev/null 2>&1

Target Levels

MetricTargetAcceptable Range
Integrated LUFS-14 LUFS-16 to -12 LUFS
True Peak-1.5 dBTP< -1.0 dBTP
Peak-3 dB< 0 dB

Binaural Beat Standards

Brainwave StateFrequency RangeUse For
Beta13-30 HzAlert, focused
Alpha8-12 HzRelaxed, light trance
Theta4-7 HzDeep trance, meditation
Delta0.5-3 HzVery deep, sleep-adjacent

Typical Journey Curve

0:00  - Alpha (10 Hz) - Induction
5:00  - Theta (7 Hz)  - Deepening
15:00 - Deep Theta (4 Hz) - Journey core
25:00 - Theta (7 Hz)  - Integration
28:00 - Alpha (10 Hz) - Emergence

Troubleshooting

IssueCauseSolution
Clipping/distortionLevels too hotCheck stem levels, use -6 dB
Binaural inaudibleLevel too lowShould be -6 dB (not -12 dB)
Voice buriedBinaural too loudVerify -6 dB on binaural
Muddy mixSample rate mismatchAll stems same rate (44100 or 48000)
SFX too harshLevel too highTry -3 dB for gentler SFX
Silent outputnormalize=1 issueUse
normalize=0
in amix

Quality Checklist

Before video assembly:

  • {session}_MASTER.mp3
    exists
  • No clipping or distortion
  • Binaural beats audible but not intrusive
  • Voice clear and present
  • SFX properly timed (if used)
  • LUFS within target range
  • Duration matches expected

Integration with Pipeline

Before (dependencies):

  • Voice synthesis complete (
    voice_enhanced.wav
    )
  • Binaural generated (
    binaural_dynamic.wav
    )
  • SFX track if needed (
    sfx_track.wav
    )

After (next steps):

  • Video assembly
  • YouTube packaging

Related Resources

  • Skill:
    tier3-production/voice-synthesis/
    (input)
  • Skill:
    tier3-production/video-assembly/
    (next step)
  • Serena Memory:
    audio_production_methodology
  • Script:
    scripts/core/hypnotic_post_process.py