Pi-skills transcribe
Speech-to-text transcription using Groq Whisper API. Supports m4a, mp3, wav, ogg, flac, webm.
install
source · Clone the upstream repo
git clone https://github.com/badlogic/pi-skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/badlogic/pi-skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/transcribe" ~/.claude/skills/badlogic-pi-skills-transcribe && rm -rf "$T"
manifest:
transcribe/SKILL.mdsource content
Transcribe
Speech-to-text using Groq Whisper API.
Setup
The script needs
GROQ_API_KEY environment variable. Check if already set:
echo $GROQ_API_KEY
If not set, guide the user through setup:
- Ask if they have a Groq API key
- If not, have them sign up at https://console.groq.com/ and create an API key
- Have them add to their shell profile (~/.zshrc or ~/.bashrc):
export GROQ_API_KEY="<their-api-key>" - Then run
(or restart terminal)source ~/.zshrc
Usage
{baseDir}/transcribe.sh <audio-file>
Supported Formats
- m4a, mp3, wav, ogg, flac, webm
- Max file size: 25MB
Output
Returns plain text transcription with punctuation and proper capitalization to stdout.