Claude-skill-registry-data media-workflow
Complete end-to-end WhatsApp media processing workflow. Processes audio/images AND integrates them into chat.md in a single step. Use when you want to do the complete daily media processing (transcribe + OCR + integrate into chat).
git clone https://github.com/majiayu000/claude-skill-registry-data
T=$(mktemp -d) && git clone --depth=1 https://github.com/majiayu000/claude-skill-registry-data "$T" && mkdir -p ~/.claude/skills && cp -r "$T/data/media-workflow" ~/.claude/skills/majiayu000-claude-skill-registry-data-media-workflow && rm -rf "$T"
data/media-workflow/SKILL.mdMedia Workflow
Overview
Automatski izvršava kompletan workflow obrade WhatsApp medija:
- Process media → Transkribuje audio, OCR-uje slike
- Integrate → Ubacuje procesirani sadržaj u chat.md fajlove
- Report → Daje summary šta je obrađeno
Što radi:
- Poziva
za audio/slikegastrohem-media-processor - Poziva
za integraciju u chat.mdchat-integrator - Sve u jednom koraku!
Output:
audio transkripcije.json
image summaries.md- Ažuriran
sa svim medijima integriranimchat.md - Backup
prije izmjenachat.md.backup
When to Use This Skill
User says:
- "Process and integrate media"
- "Do complete media workflow"
- "Process today's WhatsApp media"
- "Process and merge media for 24.10"
Default behavior: Uses today's date, processes all departments
Workflow
Simple Usage
Process and integrate for today (DEFAULT):
python .claude/skills/media-workflow/scripts/run_workflow.py
Process and integrate for specific date:
python .claude/skills/media-workflow/scripts/run_workflow.py --scan-date 24.10
Process and integrate specific folder:
python .claude/skills/media-workflow/scripts/run_workflow.py --folder "gastrohem whatsapp/administracija/20.10 - 27.10/24.10"
Dry run (preview changes):
python .claude/skills/media-workflow/scripts/run_workflow.py --dry-run
What Happens
Step 1: Media Processing
- Finds all folders for target date/folder
- Transcribes audio files in parallel (3 max)
- Creates
audio transcriptions.json - Performs OCR on images
- Creates
image summaries.md
Step 2: Integration
- Extracts timestamps from filenames
- Parses existing chat.md entries
- Merges new + existing entries chronologically
- Creates backup:
chat.md.backup - Writes updated
chat.md
Step 3: Report
- Summary of processed media
- List of integrated entries
- Any errors or skipped files
Script Reference
run_workflow.py
Purpose: Master workflow script
Arguments:
- Process all departments for this date--scan-date DD.MM
- Process specific folder--folder PATH
- Base path (default: "gastrohem whatsapp")--base-path PATH
- Preview changes without writing--dry-run
- Skip backup creation--no-backup
- Save workflow results to JSON--output-json FILE
What it does:
- Runs
scriptgastrohem-media-processor - Waits for media processing to complete
- Runs
scriptchat-integrator - Aggregates and reports results
Best Practices
- Run daily - Process media daily to avoid backlog
- Use --dry-run first - Preview changes before committing (especially first time)
- Check results - Review integrated content for accuracy
- Backups are automatic - Backup created before any chat.md modifications
Error Handling
If audio transcription fails:
- Skill continues with remaining files
- Reports failed files in summary
If image OCR fails:
- Skill continues with remaining files
- Reports failed files in summary
If integration fails:
- Backup remains intact
- Error logged with details
Example Workflow
User: "Process media"
Claude:
- Runs:
python .claude/skills/media-workflow/scripts/run_workflow.py - Step 1: Processes media for today (26.10)
- Finds 3 folders: administracija/26.10, finansije/26.10, adis-chat/26.10
- Transcribes 2 audio files →
.json - OCRs 3 images →
.md
- Step 2: Integrates into chat.md
- Merges 5 entries into 3 chat.md files
- Creates backups
- Step 3: Reports
- "Processed 2 audio + 3 images across 3 folders"
- "Integrated 5 entries into chat.md"
- "✅ Complete workflow finished successfully"
Performance
- Total time: ~10-15 sec for típical daily folder
- Media processing: ~5-8 sec (parallel)
- Integration: ~2-3 sec
- Overhead: ~1-2 sec
Output Structure
After running workflow on folder
24.10:
gastrohem whatsapp/administracija/20.10 - 27.10/24.10/ ├── chat.md # Updated with integrated media ├── chat.md.backup # Backup before integration ├── image.png ├── image.png.md # OCR summary ├── WhatsApp Audio *.mp3 └── WhatsApp Audio *.mp3.json # Audio transcription