Voice Transcribe OpenClaw Skill - ClawHub

Do you want your AI agent to automate Voice Transcribe workflows? This free skill from ClawHub helps with speech & transcription tasks without building custom tools from scratch.

What this skill does

Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).

Install

npx clawhub@latest install voice-transcribe

Full SKILL.md

Open original

name	description
voice-transcribe	Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).

voice-transcribe

transcribe audio files using openai's gpt-4o-mini-transcribe model.

when to use

when receiving voice memos (especially via whatsapp), just run:

uv run /Users/darin/clawd/skills/voice-transcribe/transcribe <audio-file>

then respond based on the transcribed content.

fixing transcription errors

if darin says a word was transcribed wrong, add it to vocab.txt (for hints) or replacements.txt (for guaranteed fix). see sections below.

supported formats

mp3, mp4, mpeg, mpga, m4a, wav, webm, ogg, opus

examples

# transcribe a voice memo
transcribe /tmp/voice-memo.ogg

# pipe to other tools
transcribe /tmp/memo.ogg | pbcopy

setup

add your openai api key to /Users/darin/clawd/skills/voice-transcribe/.env:
```
OPENAI_API_KEY=sk-...
```

custom vocabulary

add words to vocab.txt (one per line) to help the model recognize names/jargon:

Clawdis
Clawdbot

text replacements

if the model still gets something wrong, add a replacement to replacements.txt:

wrong spelling -> correct spelling

notes

assumes english (no language detection)
uses gpt-4o-mini-transcribe model specifically
caches by sha256 of audio file

Voice Transcribe OpenClaw Skill - ClawHub

What this skill does

Install

Full SKILL.md

voice-transcribe

when to use

fixing transcription errors

supported formats

examples

setup

custom vocabulary

text replacements

notes

Related skills

addis-assistant-stt

agent-voice

announcer

assemblyai-transcribe

audio-gen

audio-reply