Skills media-hub

install
source · Clone the upstream repo
git clone https://github.com/wulaosiji/skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/wulaosiji/skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/media_hub" ~/.claude/skills/wulaosiji-skills-media-hub && rm -rf "$T"
manifest: media_hub/SKILL.md
source content

Media Hub

Unified audio and video processing, transcription, and understanding.

When to Use

Use this skill when:

  • Processing audio or video files for transcription or conversion
  • Generating subtitles or speech-to-text output
  • Extracting frames or clips from video content
  • Performing batch media operations or format conversions

Do NOT use this skill if:

  • The task is creating presentation slides → use pitch-deck-creator
  • The task requires map or location data → use amap-navigator
  • You only need simple file copying without media analysis

Typical triggers:

  • 「视频转文字」「音频转录」「生成字幕」
  • "transcribe video", "convert media format", "audio to text"
  • "extract video frames", "batch process audio", "媒体转换"

Workflow

Step 1: Ingest Media

Accept input via:

  • File path to local audio/video
  • URL to downloadable media
  • Existing media directory for batch processing

Step 2: Select Operation

Choose the appropriate processing mode:

  • Transcription — speech-to-text with speaker diarization (optional)
  • Format Conversion — re-encode to mp3, mp4, wav, webm, etc.
  • Frame Extraction — capture keyframes at intervals or timestamps
  • Subtitle Generation — produce SRT/VTT from transcribed audio
  • Content Understanding — generate summaries or topic extraction from media

Step 3: Configure Parameters

  • Source language for transcription (auto-detect if not specified)
  • Output format and quality settings
  • Target directory for processed files
  • Batch size (for bulk operations)

Step 4: Execute Processing

Run the relevant pipeline and report progress:

Processing: filename.ext
- Transcoding: 45%
- Transcribing: 20%

Step 5: Deliver Results

Output files to a structured directory:

media-output/
├── transcript.txt / .srt / .vtt
├── converted/
├── frames/
└── summary.md

Guardrails

Anti-patterns

  • NEVER process copyrighted content without user confirmation
  • NEVER overwrite original source files; always output to a separate directory
  • NEVER return extremely long raw transcripts without a summary or table of contents

Output Constraints

  • Original files remain untouched
  • Transcripts include timestamps when subtitles are requested
  • Frame extractions include metadata (timestamp, resolution)
  • Summaries are concise and structured

Error Handling

  • Unsupported format: inform user and suggest conversion options
  • Large files: split into chunks or request shorter segments
  • Low-quality audio: flag accuracy warnings in transcription output

Related Skills

    • amap-navigator — Geotag or route media content when location context is needed

About UniqueClub

This skill is part of the UniqueClub toolkit. 🌐 https://uniqueclub.ai | 📂 https://github.com/wulaosiji/skills