Learn-skills.dev groq-stt

Transcribe audio files using Groq API (Whisper models). Use when user needs to transcribe audio to text.

install
source · Clone the upstream repo
git clone https://github.com/NeverSight/learn-skills.dev
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/NeverSight/learn-skills.dev "$T" && mkdir -p ~/.claude/skills && cp -r "$T/data/skills-md/958877748/skills/groq-stt" ~/.claude/skills/neversight-learn-skills-dev-groq-stt && rm -rf "$T"
manifest: data/skills-md/958877748/skills/groq-stt/SKILL.md
source content

Groq STT Skill

This skill uploads an audio file to the Groq Speech-to-Text API and saves the transcription.

Usage

# set your API key (or use .env in the repo root)
export GROQ_API_KEY=your_api_key_here

# run the script with a path to an audio file
node scripts/transcribe.mjs /path/to/audio.mp4

Output

  • Writes a
    {filename}_transcript.txt
    next to the input file.

Notes

  • Uses the
    whisper-large-v3-turbo
    model by default.
  • Supported file types: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, webm