AgentSkillOS media-processing

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration), ImageMagick (image manipulation, format conversion, batch processing, effects, composition), and RMBG (AI-powered background removal). Use when converting media formats, encoding videos with specific codecs (H.264, H.265, VP9), resizing/cropping images, removing backgrounds from images, extracting audio from video, applying filters and effects, optimizing file sizes, creating streaming manifests (HLS/DASH), generating thumbnails, batch processing images, creating composite images, or implementing media processing pipelines. Supports 100+ formats, hardware acceleration (NVENC, QSV), and complex filtergraphs.

install
source · Clone the upstream repo
git clone https://github.com/ynulihao/AgentSkillOS
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/ynulihao/AgentSkillOS "$T" && mkdir -p ~/.claude/skills && cp -r "$T/data/skill_seeds/media-processing" ~/.claude/skills/ynulihao-agentskillos-media-processing && rm -rf "$T"
manifest: data/skill_seeds/media-processing/SKILL.md
source content

Media Processing Skill

Process video, audio, and images using FFmpeg, ImageMagick, and RMBG CLI tools.

Tool Selection

TaskToolReason
Video encoding/conversionFFmpegNative codec support, streaming
Audio extraction/conversionFFmpegDirect stream manipulation
Image resize/effectsImageMagickOptimized for still images
Background removalRMBGAI-powered, local processing
Batch imagesImageMagickmogrify for in-place edits
Video thumbnailsFFmpegFrame extraction built-in
GIF creationFFmpeg/ImageMagickFFmpeg for video, ImageMagick for images

Installation

# macOS
brew install ffmpeg imagemagick
npm install -g rmbg-cli

# Ubuntu/Debian
sudo apt-get install ffmpeg imagemagick
npm install -g rmbg-cli

# Verify
ffmpeg -version && magick -version && rmbg --version

Essential Commands

# Video: Convert/re-encode
ffmpeg -i input.mkv -c copy output.mp4
ffmpeg -i input.avi -c:v libx264 -crf 22 -c:a aac output.mp4

# Video: Extract audio
ffmpeg -i video.mp4 -vn -c:a copy audio.m4a

# Image: Convert/resize
magick input.png output.jpg
magick input.jpg -resize 800x600 output.jpg

# Image: Batch resize
mogrify -resize 800x -quality 85 *.jpg

# Background removal
rmbg input.jpg                          # Basic (modnet)
rmbg input.jpg -m briaai -o output.png  # High quality
rmbg input.jpg -m u2netp -o output.png  # Fast

Key Parameters

FFmpeg:

  • -c:v libx264
    - H.264 codec
  • -crf 22
    - Quality (0-51, lower=better)
  • -preset slow
    - Speed/compression balance
  • -c:a aac
    - Audio codec

ImageMagick:

  • 800x600
    - Fit within (maintains aspect)
  • 800x600^
    - Fill (may crop)
  • -quality 85
    - JPEG quality
  • -strip
    - Remove metadata

RMBG:

  • -m briaai
    - High quality model
  • -m u2netp
    - Fast model
  • -r 4096
    - Max resolution

References

Detailed guides in

references/
:

  • ffmpeg-encoding.md
    - Codecs, quality, hardware acceleration
  • ffmpeg-streaming.md
    - HLS/DASH, live streaming
  • ffmpeg-filters.md
    - Filters, complex filtergraphs
  • imagemagick-editing.md
    - Effects, transformations
  • imagemagick-batch.md
    - Batch processing, parallel ops
  • rmbg-background-removal.md
    - AI models, CLI usage
  • common-workflows.md
    - Video optimization, responsive images, GIF creation
  • troubleshooting.md
    - Error fixes, performance tips
  • format-compatibility.md
    - Format support, codec recommendations