Joelclaw video-ingest
Download, transcribe, and summarize videos via the Inngest pipeline. Use when the user asks to grab/download/transcribe/ingest a video, save a YouTube video, or process any video URL. Also handles batch ingest of multiple URLs. This skill triggers the durable Inngest workflow — do NOT run yt-dlp, mlx-whisper, or scp manually.
git clone https://github.com/joelhooks/joelclaw
T=$(mktemp -d) && git clone --depth=1 https://github.com/joelhooks/joelclaw "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/video-ingest" ~/.claude/skills/joelhooks-joelclaw-video-ingest && rm -rf "$T"
skills/video-ingest/SKILL.mdVideo Ingest — via Inngest Pipeline
Videos are ingested through the Inngest event bus. Do not run yt-dlp, mlx-whisper, scp, or create vault notes manually. The pipeline handles everything: download → NAS transfer → transcription → vault note → summary enrichment.
Quick Start
joelclaw send pipeline/video.download -d '{"url":"URL_HERE"}'
That's it. The event chain handles the rest.
Alternative (raw curl):
curl -s -X POST "http://localhost:8288/e/37aa349b89692d657d276a40e0e47a15" \ -H "Content-Type: application/json" \ -d '{"name":"pipeline/video.download","data":{"url":"URL_HERE"}}'
Pipeline Flow
pipeline/video.download — you send this ↓ video-download function — yt-dlp → /tmp → NAS transfer ↓ emits pipeline/video.downloaded — logged by system-logger pipeline/transcript.process — auto-triggered ↓ transcript-process function — mlx-whisper (M4 Pro, ~5min/hr of video) ↓ emits pipeline/transcript.processed — logged content/summarize — auto-triggered ↓ content-summarize function — pi enrichment → vault note with summary ↓ emits content/summarized — logged, done
Before Sending: Health Check
Always verify the pipeline is healthy before sending events:
# Inngest server curl -s http://localhost:8288/health # Worker (should show functions including video-download) curl -s http://localhost:3111/ | python3 -c " import json,sys d=json.load(sys.stdin) fns = [f.get('id','?') for f in d.get('functions',[])] print(f'Worker OK — {len(fns)} functions: {', '.join(fns)}') " # Docker container running docker ps --filter ancestor=inngest/inngest --format "table {{.Status}}\t{{.Ports}}"
If the worker is down:
kubectl -n joelclaw rollout restart deployment/system-bus-worker kubectl -n joelclaw rollout status deployment/system-bus-worker --timeout=180s joelclaw refresh
Monitoring a Run
Watch progress in real-time
# Worker logs — shows step execution + failures kubectl logs -n joelclaw deploy/system-bus-worker -f # Docker logs — shows event dispatch docker logs -f $(docker ps -q --filter ancestor=inngest/inngest) 2>&1 | grep -v DEBUG
Check if events fired
# Look for the video's events in Docker logs docker logs $(docker ps -q --filter ancestor=inngest/inngest) 2>&1 | grep -i "video\|transcript\|summarize" | tail -20
Dashboard
Open http://localhost:8288 in browser — shows functions, events, runs with per-step traces.
Verify completion
# Check if vault note was created ls -la ~/Vault/Resources/videos/*SLUG* # Check system log for pipeline entries tail -10 ~/Vault/system/system-log.jsonl | grep -i video
Batch Ingest
Send multiple events. Inngest queues and processes them with concurrency control:
joelclaw send pipeline/video.download -d '{"url":"https://youtube.com/watch?v=XXXX"}' joelclaw send pipeline/video.download -d '{"url":"https://youtube.com/watch?v=YYYY"}' joelclaw send pipeline/video.download -d '{"url":"https://youtube.com/watch?v=ZZZZ"}'
Manual Transcript (Non-YouTube)
For audio files already on disk, or raw text from Granola/Fathom:
# From audio file joelclaw send pipeline/transcript.process -d '{"source":"manual","audioPath":"/path/to/audio.mp4","title":"Recording Title","slug":"recording-title"}' # From raw text (Granola, Fathom, etc.) joelclaw send pipeline/transcript.process -d '{"source":"granola","text":"transcript text...","title":"Meeting Title","slug":"meeting-title"}'
Re-run Summary Only
If the vault note exists but needs a better summary:
joelclaw send content/summarize -d '{"vaultPath":"/Users/joel/Vault/Resources/videos/SLUG.md"}'
Options
| Field | Default | Description |
|---|---|---|
| required | YouTube or video URL |
| | Max video resolution: , , |
Where Things End Up
| What | Location |
|---|---|
| Video + metadata | NAS: |
| Vault note | |
| Daily note link | under |
| System log entry | |
Troubleshooting
If events are accepted (200 OK) but nothing happens:
-
Check Docker→worker connectivity — the most common issue:
docker logs $(docker ps -q --filter ancestor=inngest/inngest) 2>&1 | grep -E "ERROR|Unable" | tail -5If you see "Unable to reach SDK URL" → see the inngest skill's serveHost gotcha.
-
Check worker is actually running:
kubectl get deploy -n joelclaw system-bus-worker kubectl get pods -n joelclaw -l app=system-bus-worker -
Check worker errors for the specific function:
kubectl logs -n joelclaw deploy/system-bus-worker --tail=80 -
Use inngest-debug skill for deep inspection of specific run IDs via GraphQL.
What NOT to Do
- ❌ Don't run
directly — the pipeline handles download + NAS transferyt-dlp - ❌ Don't run
directly — the pipeline handles transcriptionmlx_whisper - ❌ Don't
to NAS manually — the pipeline handles transferscp - ❌ Don't create vault notes manually — the pipeline creates them with proper frontmatter
- ❌ Don't use codex/background tasks for video processing — Inngest is durable and has retries