Walkie Talkie OpenClaw Skill - ClawHub
Do you want your AI agent to automate Walkie Talkie workflows? This free skill from ClawHub helps with communication tasks without building custom tools from scratch.
What this skill does
Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.
Install
npx clawhub@latest install walkie-talkieFull SKILL.md
Open original| name | description |
|---|---|
| walkie-talkie | Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type. |
Walkie-Talkie Mode
This skill automates the voice-to-voice loop on WhatsApp using local transcription and local TTS.
Workflow
-
Incoming Audio: When a user sends an audio/ogg/opus file:
- Use
tools/transcribe_voice.shto get the text. - Process the text as a normal user prompt.
- Use
-
Outgoing Response:
- Instead of a text reply, generate speech using
bin/sherpa-onnx-tts. - Send the resulting
.oggfile back to the user as a voice note.
- Instead of a text reply, generate speech using
Triggers
- User sends an audio message.
- User says "activa modo walkie-talkie" or "hablemos por voz".
Constraints
- Use local tools only (ffmpeg, whisper-cpp, sherpa-onnx-tts).
- Maintain a fast response time (RTF < 0.5).
- Always reply with BOTH text (for clarity) and audio.
Manual Execution (Internal)
To respond with voice manually:
bin/sherpa-onnx-tts /tmp/reply.ogg "Tu mensaje aquí"
Then send /tmp/reply.ogg via message tool with filePath.