Vocal Chat OpenClaw Skill - ClawHub

Do you want your AI agent to automate Vocal Chat workflows? This free skill from ClawHub helps with browser & automation tasks without building custom tools from scratch.

What this skill does

Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.

Install

npx clawhub@latest install vocal-chat

Full SKILL.md

Open original

name	description
walkie-talkie	Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.

Walkie-Talkie Mode

This skill automates the voice-to-voice loop on WhatsApp using local transcription and local TTS.

Workflow

Incoming Audio: When a user sends an audio/ogg/opus file:
- Use tools/transcribe_voice.sh to get the text.
- Process the text as a normal user prompt.
Outgoing Response:
- Instead of a text reply, generate speech using bin/sherpa-onnx-tts.
- Send the resulting .ogg file back to the user as a voice note.

Triggers

User sends an audio message.
User says "activa modo walkie-talkie" or "hablemos por voz".

Constraints

Use local tools only (ffmpeg, whisper-cpp, sherpa-onnx-tts).
Maintain a fast response time (RTF < 0.5).
Always reply with BOTH text (for clarity) and audio.

Manual Execution (Internal)

To respond with voice manually:

bin/sherpa-onnx-tts /tmp/reply.ogg "Tu mensaje aquí"

Then send /tmp/reply.ogg via message tool with filePath.

Vocal Chat OpenClaw Skill - ClawHub

What this skill does

Install

Full SKILL.md

Walkie-Talkie Mode

Workflow

Triggers

Constraints

Manual Execution (Internal)

Related skills

2captcha

abm-outbound

accessibility-toolkit

activecampaign

adcp-advertising

Agent Browser