AgentSkillOS media-processing
Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration), ImageMagick (image manipulation, format conversion, batch processing, effects, composition), and RMBG (AI-powered background removal). Use when converting media formats, encoding videos with specific codecs (H.264, H.265, VP9), resizing/cropping images, removing backgrounds from images, extracting audio from video, applying filters and effects, optimizing file sizes, creating streaming manifests (HLS/DASH), generating thumbnails, batch processing images, creating composite images, or implementing media processing pipelines. Supports 100+ formats, hardware acceleration (NVENC, QSV), and complex filtergraphs.
git clone https://github.com/ynulihao/AgentSkillOS
T=$(mktemp -d) && git clone --depth=1 https://github.com/ynulihao/AgentSkillOS "$T" && mkdir -p ~/.claude/skills && cp -r "$T/data/skill_seeds/media-processing" ~/.claude/skills/ynulihao-agentskillos-media-processing && rm -rf "$T"
data/skill_seeds/media-processing/SKILL.mdMedia Processing Skill
Process video, audio, and images using FFmpeg, ImageMagick, and RMBG CLI tools.
Tool Selection
| Task | Tool | Reason |
|---|---|---|
| Video encoding/conversion | FFmpeg | Native codec support, streaming |
| Audio extraction/conversion | FFmpeg | Direct stream manipulation |
| Image resize/effects | ImageMagick | Optimized for still images |
| Background removal | RMBG | AI-powered, local processing |
| Batch images | ImageMagick | mogrify for in-place edits |
| Video thumbnails | FFmpeg | Frame extraction built-in |
| GIF creation | FFmpeg/ImageMagick | FFmpeg for video, ImageMagick for images |
Installation
# macOS brew install ffmpeg imagemagick npm install -g rmbg-cli # Ubuntu/Debian sudo apt-get install ffmpeg imagemagick npm install -g rmbg-cli # Verify ffmpeg -version && magick -version && rmbg --version
Essential Commands
# Video: Convert/re-encode ffmpeg -i input.mkv -c copy output.mp4 ffmpeg -i input.avi -c:v libx264 -crf 22 -c:a aac output.mp4 # Video: Extract audio ffmpeg -i video.mp4 -vn -c:a copy audio.m4a # Image: Convert/resize magick input.png output.jpg magick input.jpg -resize 800x600 output.jpg # Image: Batch resize mogrify -resize 800x -quality 85 *.jpg # Background removal rmbg input.jpg # Basic (modnet) rmbg input.jpg -m briaai -o output.png # High quality rmbg input.jpg -m u2netp -o output.png # Fast
Key Parameters
FFmpeg:
- H.264 codec-c:v libx264
- Quality (0-51, lower=better)-crf 22
- Speed/compression balance-preset slow
- Audio codec-c:a aac
ImageMagick:
- Fit within (maintains aspect)800x600
- Fill (may crop)800x600^
- JPEG quality-quality 85
- Remove metadata-strip
RMBG:
- High quality model-m briaai
- Fast model-m u2netp
- Max resolution-r 4096
References
Detailed guides in
references/:
- Codecs, quality, hardware accelerationffmpeg-encoding.md
- HLS/DASH, live streamingffmpeg-streaming.md
- Filters, complex filtergraphsffmpeg-filters.md
- Effects, transformationsimagemagick-editing.md
- Batch processing, parallel opsimagemagick-batch.md
- AI models, CLI usagermbg-background-removal.md
- Video optimization, responsive images, GIF creationcommon-workflows.md
- Error fixes, performance tipstroubleshooting.md
- Format support, codec recommendationsformat-compatibility.md