✓ Verified
💻 Development
✓ Enhanced Data
Local Vosk
Local speech-to-text using Vosk.
- Rating
- 4.2 (149 reviews)
- Downloads
- 15,950 downloads
- Version
- 1.0.0
Overview
Local speech-to-text using Vosk.
Complete Documentation
View Source →
Local Vosk STT
Lightweight local speech-to-text using Vosk. Fully offline after model download.
Use Cases
- Telegram voice messages — transcribe .ogg voice notes automatically
- Audio files — any format ffmpeg supports
- Offline transcription — no API keys, no cloud, no costs
Quick Start
bash
# Transcribe Telegram voice message
./skills/local-vosk/scripts/transcribe voice_message.ogg
# Transcribe any audio
./skills/local-vosk/scripts/transcribe audio.mp3
# With language (default: en-us)
./skills/local-vosk/scripts/transcribe audio.wav --lang en-us
Supported Formats
Any format ffmpeg can decode: ogg (Telegram), mp3, wav, m4a, webm, flac, etc.
Models
Default model: vosk-model-small-en-us-0.15 (~40MB)
Other models available at https://alphacephei.com/vosk/models
Setup (if not installed)
bash
pip3 install vosk --user --break-system-packages
# Download model
mkdir -p ~/vosk-models && cd ~/vosk-models
wget https://alphacephei.com/vosk/models/vosk-model-small-en-us-0.15.zip
unzip vosk-model-small-en-us-0.15.zip
Notes
- Quality is good for conversational speech
- For higher accuracy, use larger models or faster-whisper
- Processes audio at ~10x realtime on typical hardware
- Telegram voice messages are .ogg format — works out of the box
Installation
Terminal bash
openclaw install local-vosk
Copied!
💻Code Examples
./skills/local-vosk/scripts/transcribe audio.wav --lang en-us
skillslocal-voskscriptstranscribe-audiowav---lang-en-us.txt
## Supported Formats
Any format ffmpeg can decode: **ogg** (Telegram), mp3, wav, m4a, webm, flac, etc.
## Models
Default model: `vosk-model-small-en-us-0.15` (~40MB)
Other models available at https://alphacephei.com/vosk/models
## Setup (if not installed)example.sh
# Transcribe Telegram voice message
./skills/local-vosk/scripts/transcribe voice_message.ogg
# Transcribe any audio
./skills/local-vosk/scripts/transcribe audio.mp3
# With language (default: en-us)
./skills/local-vosk/scripts/transcribe audio.wav --lang en-usexample.sh
pip3 install vosk --user --break-system-packages
# Download model
mkdir -p ~/vosk-models && cd ~/vosk-models
wget https://alphacephei.com/vosk/models/vosk-model-small-en-us-0.15.zip
unzip vosk-model-small-en-us-0.15.zipTags
#devops_and-cloud
Quick Info
Category Development
Model Claude 3.5
Complexity One-Click
Author sfkiwi
Last Updated 3/10/2026
🚀
Optimized for
Claude 3.5
Ready to Install?
Get started with this skill in seconds
openclaw install local-vosk
Related Skills
✓ Verified
💻 Development
4claw
4claw — a moderated imageboard for AI agents.
🧠 Claude-Ready
)}
★ 4.4 (118)
↓ 4,990
v1.0.0
✓ Verified
💻 Development
Aap Passport
Agent Attestation Protocol - The Reverse Turing Test.
🧠 Claude-Ready
)}
★ 4.3 (89)
↓ 4,621
v1.0.0
✓ Verified
💻 Development
Acestep Lyrics Transcription
Transcribe audio to timestamped lyrics using OpenAI Whisper or ElevenLabs Scribe API.
⚡ GPT-Optimized
)}
★ 3.8 (274)
↓ 17,648
v1.0.0
✓ Verified
💻 Development
Adaptive Suite
A continuously adaptive skill suite that empowers Clawdbot.
🧠 Claude-Ready
)}
★ 4.7 (88)
↓ 1,625
v1.0.0