Skills qwen-asr

Transcribe audio files using Qwen ASR (千问STT). Use when the user sends voice messages and wants them converted to text.

install
source · Clone the upstream repo
git clone https://github.com/openclaw/skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/al-one/qwen-asr" ~/.claude/skills/openclaw-skills-qwen-asr && rm -rf "$T"
OpenClaw · Install into ~/.openclaw/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.openclaw/skills && cp -r "$T/skills/al-one/qwen-asr" ~/.openclaw/skills/openclaw-skills-qwen-asr && rm -rf "$T"
manifest: skills/al-one/qwen-asr/SKILL.md
source content

Qwen ASR

Transcribe an audio file (wav/mp3/ogg...) to text using Qwen ASR. No configuration or API key required.

Usage

uv run scripts/main.py -f audio.wav
cat audio.wav | uv run scripts/main.py > transcript.txt

About

Qwen ASR is a free and open-source speech-to-text model. It is trained on a large dataset of audio files from the web. It is available in multiple languages. This skill bases on the Qwen ASR Demo service (qwen-qwen3-asr-demo.ms.show).