Skills media-hub

Name: media-hub
Author: wulaosiji

install

source · Clone the upstream repo

git clone https://github.com/wulaosiji/skills

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/wulaosiji/skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/media_hub" ~/.claude/skills/wulaosiji-skills-media-hub && rm -rf "$T"

manifest: media_hub/SKILL.md

source content

Media Hub

Unified audio and video processing, transcription, and understanding.

When to Use

Use this skill when:

Processing audio or video files for transcription or conversion
Generating subtitles or speech-to-text output
Extracting frames or clips from video content
Performing batch media operations or format conversions

Do NOT use this skill if:

The task is creating presentation slides → use pitch-deck-creator
The task requires map or location data → use amap-navigator
You only need simple file copying without media analysis

Typical triggers:

「视频转文字」「音频转录」「生成字幕」
"transcribe video", "convert media format", "audio to text"
"extract video frames", "batch process audio", "媒体转换"

Workflow

Step 1: Ingest Media

Accept input via:

File path to local audio/video
URL to downloadable media
Existing media directory for batch processing

Step 2: Select Operation

Choose the appropriate processing mode:

Transcription — speech-to-text with speaker diarization (optional)
Format Conversion — re-encode to mp3, mp4, wav, webm, etc.
Frame Extraction — capture keyframes at intervals or timestamps
Subtitle Generation — produce SRT/VTT from transcribed audio
Content Understanding — generate summaries or topic extraction from media

Step 3: Configure Parameters

Source language for transcription (auto-detect if not specified)
Output format and quality settings
Target directory for processed files
Batch size (for bulk operations)

Step 4: Execute Processing

Run the relevant pipeline and report progress:

Processing: filename.ext
- Transcoding: 45%
- Transcribing: 20%

Step 5: Deliver Results

Output files to a structured directory:

media-output/
├── transcript.txt / .srt / .vtt
├── converted/
├── frames/
└── summary.md

Guardrails

Anti-patterns

NEVER process copyrighted content without user confirmation
NEVER overwrite original source files; always output to a separate directory
NEVER return extremely long raw transcripts without a summary or table of contents

Output Constraints

Original files remain untouched
Transcripts include timestamps when subtitles are requested
Frame extractions include metadata (timestamp, resolution)
Summaries are concise and structured

Error Handling

Unsupported format: inform user and suggest conversion options
Large files: split into chunks or request shorter segments
Low-quality audio: flag accuracy warnings in transcription output

Related Skills

- amap-navigator — Geotag or route media content when location context is needed

About UniqueClub

This skill is part of the UniqueClub toolkit. 🌐 https://uniqueclub.ai | 📂 https://github.com/wulaosiji/skills