PythonClaw tts
Text-to-speech — convert text to audio using gTTS or edge-tts. Use when: user asks to read text aloud, generate an audio file from text, or create a voiceover. NOT for: speech-to-text/transcription (use Deepgram or whisper), music generation, or audio editing.
install
source · Clone the upstream repo
git clone https://github.com/ericwang915/PythonClaw
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/ericwang915/PythonClaw "$T" && mkdir -p ~/.claude/skills && cp -r "$T/pythonclaw/templates/skills/media/tts" ~/.claude/skills/ericwang915-pythonclaw-tts && rm -rf "$T"
manifest:
pythonclaw/templates/skills/media/tts/SKILL.mdsource content
Text-to-Speech
Convert text to spoken audio files.
When to Use
✅ USE this skill when:
- "Read this text aloud"
- "Generate an audio file of this paragraph"
- "Create a voiceover for this script"
- "Convert this text to speech"
- "Say this in Spanish"
When NOT to Use
❌ DON'T use this skill when:
- Speech-to-text / transcription → use Deepgram or Whisper
- Music generation → use specialised music tools
- Audio editing or effects → use ffmpeg or audio editors
- Playing existing audio files → use system player
Usage
Basic TTS (using gTTS — Google Translate TTS)
python {skill_path}/speak.py "Hello, this is PythonClaw speaking." --output hello.mp3
Options
# Different language python {skill_path}/speak.py "Bonjour le monde" --lang fr # Slow speed python {skill_path}/speak.py "Important message" --slow # Custom output path python {skill_path}/speak.py "Text here" --output ~/audio/speech.mp3
Alternative: edge-tts (higher quality, more voices)
If
edge-tts is installed (pip install edge-tts):
python {skill_path}/speak.py "Hello world" --engine edge --voice en-US-AriaNeural
Available edge-tts voices (examples)
— Female, US English (default)en-US-AriaNeural
— Male, US Englishen-US-GuyNeural
— Female, British Englishen-GB-SoniaNeural
— Female, Chinesezh-CN-XiaoxiaoNeural
— Female, Japaneseja-JP-NanamiNeural
— Female, Germande-DE-KatjaNeural
List all voices:
python -m edge_tts --list-voices
Notes
- gTTS requires internet (uses Google Translate's TTS endpoint)
- edge-tts requires internet (uses Microsoft Edge's TTS service)
- Output format is MP3 by default
- For offline TTS, consider
(lower quality but no network needed)pyttsx3
Resources
| File | Description |
|---|---|
| Multi-engine TTS wrapper (gTTS + edge-tts) |