Pi-skills transcribe

Speech-to-text transcription using Groq Whisper API. Supports m4a, mp3, wav, ogg, flac, webm.

install

source · Clone the upstream repo

git clone https://github.com/badlogic/pi-skills

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/badlogic/pi-skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/transcribe" ~/.claude/skills/badlogic-pi-skills-transcribe && rm -rf "$T"

manifest: transcribe/SKILL.md

source content

Transcribe

Speech-to-text using Groq Whisper API.

Setup

The script needs

GROQ_API_KEY

environment variable. Check if already set:

echo $GROQ_API_KEY

If not set, guide the user through setup:

Ask if they have a Groq API key
If not, have them sign up at https://console.groq.com/ and create an API key

Have them add to their shell profile (~/.zshrc or ~/.bashrc):

export GROQ_API_KEY="<their-api-key>"

Then run
```
source ~/.zshrc
```
(or restart terminal)

Usage

{baseDir}/transcribe.sh <audio-file>

Supported Formats

m4a, mp3, wav, ogg, flac, webm
Max file size: 25MB

Output

Returns plain text transcription with punctuation and proper capitalization to stdout.