Skills audio-transcribe

Name: audio-transcribe
Author: openclaw

Audio Transcription Skill

install

source · Clone the upstream repo

git clone https://github.com/openclaw/skills

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/aktheknight/audio-transcribe" ~/.claude/skills/openclaw-skills-audio-transcribe && rm -rf "$T"

OpenClaw · Install into ~/.openclaw/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.openclaw/skills && cp -r "$T/skills/aktheknight/audio-transcribe" ~/.openclaw/skills/openclaw-skills-audio-transcribe && rm -rf "$T"

manifest: skills/aktheknight/audio-transcribe/SKILL.md

Audio Transcription Skill

Auto-transcribe voice messages using faster-whisper (local, no API key needed).

Requirements

pip install faster-whisper

Models download automatically on first use.

Usage

Transcribe a file

python3 /root/clawd/skills/audio-transcribe/scripts/transcribe.py /path/to/audio.ogg

Change model (edit script)

Edit

transcribe.py

and change:

model = WhisperModel('small', device='cpu', compute_type='int8')  # Options: tiny, base, small, medium, large-v3

Models

Model	Size	VRAM/RAM	Speed	Use Case
tiny	39 MB	~1 GB	⚡⚡⚡	Quick drafts
base	74 MB	~1 GB	⚡⚡	Basic accuracy
small	244 MB	~2 GB	⚡	Recommended
medium	769 MB	~5 GB	🐢	Better accuracy
large-v3	1.5 GB	~10 GB	🐢🐢	Best accuracy

Integration

Clawdbot auto-transcribes incoming voice messages when this skill is enabled.

Files

```
scripts/transcribe.py
```
— Main transcription script
```
SKILL.md
```
— This file