Awesome-openclaw-skills walkie-talkie-2

Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.

install

source · Clone the upstream repo

git clone https://github.com/sundial-org/awesome-openclaw-skills

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/sundial-org/awesome-openclaw-skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/walkie-talkie-2" ~/.claude/skills/sundial-org-awesome-openclaw-skills-walkie-talkie-2 && rm -rf "$T"

OpenClaw · Install into ~/.openclaw/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/sundial-org/awesome-openclaw-skills "$T" && mkdir -p ~/.openclaw/skills && cp -r "$T/skills/walkie-talkie-2" ~/.openclaw/skills/sundial-org-awesome-openclaw-skills-walkie-talkie-2 && rm -rf "$T"

manifest: skills/walkie-talkie-2/SKILL.md

source content

Walkie-Talkie Mode

This skill automates the voice-to-voice loop on WhatsApp using local transcription and local TTS.

Workflow

Incoming Audio: When a user sends an audio/ogg/opus file:
- Use
```
tools/transcribe_voice.sh
```
  to get the text.
- Process the text as a normal user prompt.
Outgoing Response:
- Instead of a text reply, generate speech using
```
bin/sherpa-onnx-tts
```
  .
- Send the resulting
```
.ogg
```
  file back to the user as a voice note.

Triggers

User sends an audio message.
User says "activa modo walkie-talkie" or "hablemos por voz".

Constraints

Use local tools only (ffmpeg, whisper-cpp, sherpa-onnx-tts).
Maintain a fast response time (RTF < 0.5).
Always reply with BOTH text (for clarity) and audio.

Manual Execution (Internal)

To respond with voice manually:

bin/sherpa-onnx-tts /tmp/reply.ogg "Tu mensaje aquí"

Then send

/tmp/reply.ogg

via

message

tool with

filePath