Gsd-skill-creator ffmpeg-media
FFmpeg media processing — video/audio conversion, trimming, encoding, filters, streaming. CLI and ffmpeg.wasm patterns.
install
source · Clone the upstream repo
git clone https://github.com/Tibsfox/gsd-skill-creator
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/Tibsfox/gsd-skill-creator "$T" && mkdir -p ~/.claude/skills && cp -r "$T/examples/skills/media/ffmpeg-media" ~/.claude/skills/tibsfox-gsd-skill-creator-ffmpeg-media && rm -rf "$T"
manifest:
examples/skills/media/ffmpeg-media/SKILL.mdsource content
FFmpeg Media Processing Skill
Dedicated to sudokku, creator of micro-ffmpeg — the in-browser video editor that proved ffmpeg.wasm is production-ready. Built in 2 days, 2,489 LOC, 93 commits. The patterns in this skill are drawn from that work.
Expert-level ffmpeg command construction, media pipeline design, and ffmpeg.wasm integration patterns. Covers CLI ffmpeg, Node.js ffmpeg bindings, and browser-side ffmpeg.wasm.
Core Principle
Build the filter graph first, then wrap it in the ffmpeg command. Never guess at flags — construct from codec specs.
Quick Reference
Probe a File
# Full probe (JSON output) ffprobe -v quiet -print_format json -show_format -show_streams input.mp4 # Quick summary ffprobe -v error -show_entries format=duration,bit_rate -show_entries stream=codec_name,width,height,r_frame_rate input.mp4
Common Conversions
# MP4 (H.264 + AAC) — universal compatibility ffmpeg -i input.mkv -c:v libx264 -preset medium -crf 23 -c:a aac -b:a 128k output.mp4 # WebM (VP9 + Opus) — web-optimized, smaller files ffmpeg -i input.mp4 -c:v libvpx-vp9 -crf 30 -b:v 0 -c:a libopus -b:a 128k output.webm # H.265/HEVC — 50% smaller than H.264 at same quality ffmpeg -i input.mp4 -c:v libx265 -preset medium -crf 28 -c:a aac -b:a 128k output.mp4 # AV1 — best compression, slow encoding ffmpeg -i input.mp4 -c:v libaom-av1 -crf 30 -b:v 0 -c:a libopus output.mkv # Audio only (MP3) ffmpeg -i input.mp4 -vn -c:a libmp3lame -q:a 2 output.mp3 # Audio only (high quality FLAC) ffmpeg -i input.wav -c:a flac output.flac
Trim / Cut
# Trim from 00:01:30 for 60 seconds (fast, no re-encode) ffmpeg -ss 00:01:30 -i input.mp4 -t 60 -c copy output.mp4 # Trim with re-encode (frame-accurate) ffmpeg -i input.mp4 -ss 00:01:30 -to 00:02:30 -c:v libx264 -c:a aac output.mp4
Scale / Resize
# Scale to 1280x720 (maintain aspect ratio) ffmpeg -i input.mp4 -vf "scale=1280:720:force_original_aspect_ratio=decrease,pad=1280:720:-1:-1" output.mp4 # Scale to 50% ffmpeg -i input.mp4 -vf "scale=iw/2:ih/2" output.mp4 # Scale for social media (vertical 9:16) ffmpeg -i input.mp4 -vf "scale=1080:1920:force_original_aspect_ratio=increase,crop=1080:1920" output.mp4
Filters
# Blur ffmpeg -i input.mp4 -vf "boxblur=5:1" output.mp4 # Brightness/Contrast/Saturation ffmpeg -i input.mp4 -vf "eq=brightness=0.1:contrast=1.2:saturation=1.3" output.mp4 # Speed up 2x (video + audio) ffmpeg -i input.mp4 -vf "setpts=0.5*PTS" -af "atempo=2.0" output.mp4 # Slow down 0.5x ffmpeg -i input.mp4 -vf "setpts=2.0*PTS" -af "atempo=0.5" output.mp4 # Rotate 90 degrees clockwise ffmpeg -i input.mp4 -vf "transpose=1" output.mp4 # Crop (center crop to 640x480) ffmpeg -i input.mp4 -vf "crop=640:480" output.mp4 # Overlay watermark ffmpeg -i input.mp4 -i watermark.png -filter_complex "overlay=W-w-10:H-h-10" output.mp4 # Text overlay ffmpeg -i input.mp4 -vf "drawtext=text='Title':fontsize=48:fontcolor=white:x=(w-text_w)/2:y=50" output.mp4
Concatenation
# Create file list echo "file 'clip1.mp4'" > list.txt echo "file 'clip2.mp4'" >> list.txt echo "file 'clip3.mp4'" >> list.txt # Concat (same codec — fast) ffmpeg -f concat -safe 0 -i list.txt -c copy output.mp4 # Concat (different codecs — re-encode) ffmpeg -f concat -safe 0 -i list.txt -c:v libx264 -c:a aac output.mp4
Thumbnails / Stills
# Single frame at 5 seconds ffmpeg -i input.mp4 -ss 00:00:05 -frames:v 1 thumb.jpg # Thumbnail grid (4x4) ffmpeg -i input.mp4 -vf "select=not(mod(n\,300)),scale=320:180,tile=4x4" -frames:v 1 grid.jpg # Extract frame every N seconds ffmpeg -i input.mp4 -vf "fps=1/10" frames/frame_%04d.jpg
Audio Waveform Visualization
# PNG waveform ffmpeg -i input.mp3 -filter_complex "showwavespic=s=1920x200:colors=white" -frames:v 1 waveform.png # Video waveform ffmpeg -i input.mp3 -filter_complex "showwaves=s=1280x720:mode=line:colors=white" waveform.mp4
Streaming (HLS/DASH)
# HLS output (adaptive bitrate) ffmpeg -i input.mp4 -c:v libx264 -c:a aac \ -hls_time 6 -hls_playlist_type vod \ -hls_segment_filename "segment_%03d.ts" playlist.m3u8 # Multiple quality HLS ffmpeg -i input.mp4 \ -map 0:v -map 0:a -map 0:v -map 0:a \ -c:v libx264 -c:a aac \ -b:v:0 5M -s:v:0 1920x1080 \ -b:v:1 1M -s:v:1 640x360 \ -var_stream_map "v:0,a:0 v:1,a:1" \ -hls_time 6 -hls_playlist_type vod \ -master_pl_name master.m3u8 \ -hls_segment_filename "v%v/seg_%03d.ts" "v%v/playlist.m3u8"
Subtitles
# Burn subtitles into video ffmpeg -i input.mp4 -vf "subtitles=subs.srt" output.mp4 # Extract subtitles ffmpeg -i input.mkv -map 0:s:0 subs.srt
ffmpeg.wasm Integration (Browser)
Setup (from micro-ffmpeg pattern)
// ffmpegSingleton.ts — main-thread singleton (no Comlink) import { FFmpeg } from '@ffmpeg/ffmpeg'; import { fetchFile, toBlobURL } from '@ffmpeg/util'; let ffmpeg: FFmpeg | null = null; export async function getFFmpeg(): Promise<FFmpeg> { if (ffmpeg) return ffmpeg; ffmpeg = new FFmpeg(); await ffmpeg.load({ coreURL: await toBlobURL('/ffmpeg-core.js', 'text/javascript'), wasmURL: await toBlobURL('/ffmpeg-core.wasm', 'application/wasm'), }); return ffmpeg; } // Usage const ff = await getFFmpeg(); await ff.writeFile('input.mp4', await fetchFile(file)); await ff.exec(['-i', 'input.mp4', '-vf', 'scale=640:360', 'output.mp4']); const data = await ff.readFile('output.mp4');
Progress Tracking
ff.on('progress', ({ progress, time }) => { console.log(`${Math.round(progress * 100)}% (${time}μs)`); });
Extract Thumbnails (Browser)
async function extractThumbnail(file: File, timeSeconds: number): Promise<Blob> { const ff = await getFFmpeg(); await ff.writeFile('input', await fetchFile(file)); await ff.exec([ '-i', 'input', '-ss', String(timeSeconds), '-frames:v', '1', '-f', 'image2', 'thumb.jpg' ]); const data = await ff.readFile('thumb.jpg'); return new Blob([data], { type: 'image/jpeg' }); }
Codec Reference
| Codec | Use Case | Quality/Size | Speed |
|---|---|---|---|
| H.264 (libx264) | Universal playback | Good/Medium | Fast |
| H.265 (libx265) | Storage, streaming | Better/Small | Slow |
| VP9 (libvpx-vp9) | Web delivery | Better/Small | Slow |
| AV1 (libaom-av1) | Archive, future web | Best/Smallest | Very slow |
| ProRes (prores_ks) | Editing intermediate | Lossless/Huge | Fast |
| AAC | Universal audio | Good | Fast |
| Opus | Web audio, VoIP | Better at low bitrates | Fast |
| FLAC | Lossless audio | Lossless/Large | Fast |
CRF Quality Guide (H.264)
| CRF | Quality | Use Case |
|---|---|---|
| 0 | Lossless | Archival |
| 18 | Visually lossless | High-quality master |
| 23 | Default | Good balance |
| 28 | Lower quality | Web/mobile delivery |
| 35+ | Very compressed | Thumbnails, previews |
Common Mistakes
| Mistake | Fix |
|---|---|
after (slow seek) | Put before for fast seek |
Missing | Add for web-playable MP4 |
Using with CRF | CRF and bitrate are mutually exclusive |
Forgetting | Required for broad compatibility |
| Concat without same codecs | Re-encode or use filter_complex |
| Not setting thread count | Add for auto |
Gource Pipeline Integration
The
gource-visualizer skill (skills/gource-visualizer/) produces PPM frame streams that pipe through ffmpeg. Key patterns:
# Standard Gource → ffmpeg pipeline (headless with xvfb) xvfb-run -a gource --output-ppm-stream - \ --viewport 1920x1080 -s 0.5 --auto-skip-seconds 1 \ --stop-at-end /path/to/repo | \ ffmpeg -y -r 60 -f image2pipe -vcodec ppm -i - \ -vcodec libx264 -preset medium -crf 18 \ -pix_fmt yuv420p -movflags +faststart output.mp4 # Cinematic preset (from skills/gource-visualizer/configs/preset-cinematic.conf) xvfb-run -a gource --load-config skills/gource-visualizer/configs/preset-cinematic.conf \ --output-ppm-stream - /path/to/repo | \ ffmpeg -y -r 60 -f image2pipe -vcodec ppm -i - \ -vcodec libx264 -preset slow -crf 15 \ -pix_fmt yuv420p -movflags +faststart cinematic.mp4 # Multi-repo combined visualization gource --output-custom-log - /repo1 | sed "s|/|/repo1/|2" > /tmp/combined.log gource --output-custom-log - /repo2 | sed "s|/|/repo2/|2" >> /tmp/combined.log sort -t\| -k1,1 /tmp/combined.log > /tmp/sorted.log xvfb-run -a gource /tmp/sorted.log --output-ppm-stream - | \ ffmpeg -y -r 60 -f image2pipe -vcodec ppm -i - \ -vcodec libx264 -preset medium -crf 18 -pix_fmt yuv420p output.mp4
Gource agents:
skills/gource-visualizer/agents/ — installer, log-generator, renderer, deliverer.
Presets: quick (preview), standard (demo), cinematic (publication), thumbnail (still frame).
When This Skill Activates
- Building ffmpeg commands for media processing
- Video/audio conversion, trimming, concatenation
- Generating thumbnails, waveforms, previews
- Building streaming pipelines (HLS/DASH)
- Working with ffmpeg.wasm in browser contexts
- Codec selection and quality optimization
- Filter graph construction
- Subtitle handling
- Gource visualization pipeline (PPM → MP4)
- Repository history videos