✓ Verified
💻 Development
✓ Enhanced Data
Voice Recognition
Local speech-to-text with OpenAI Whisper CLI.
- Rating
- 3.9 (166 reviews)
- Downloads
- 2,632 downloads
- Version
- 1.0.0
Overview
Local speech-to-text with OpenAI Whisper CLI.
✨Key Features
1
Local processing - No API key needed, free
2
Multi-language - Chinese, English, 100+ languages
3
Translation - Translate to English
4
Summarization - Generate quick summary
Complete Documentation
View Source →
Voice Recognition (Whisper)
Local speech-to-text with OpenAI Whisper CLI.
Features
- Local processing - No API key needed, free
- Multi-language - Chinese, English, 100+ languages
- Translation - Translate to English
- Summarization - Generate quick summary
Usage
Basic
bash
# Chinese recognition
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a
# Force Chinese
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --zh
# English recognition
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --en
# Translate to English
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --translate
# With summary
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --summarize
Quick Command (add to ~/.zshrc)
bash
alias voice="python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py"
Then use:
bash
voice ~/Downloads/audio.m4a --zh
Requirements
- OpenAI Whisper CLI:
brew install openai-whisper - Python 3.10+
Files
scripts/voice识别_升级版.py- Main scriptscripts/voice_tool_README.md- Documentation
Supported Formats
- MP3, M4A, WAV, OGG, FLAC, WebM
Language Support
100+ languages including:
- Chinese (zh)
- English (en)
- Japanese (ja)
- Korean (ko)
- And more...
Notes
- Default model:
medium(balance of speed and accuracy) - First run downloads model to
~/.cache/whisper - Processing time varies by audio length and model size
Installation
Terminal bash
openclaw install voice-recognition
Copied!
💻Code Examples
example.sh
# Chinese recognition
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a
# Force Chinese
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --zh
# English recognition
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --en
# Translate to English
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --translate
# With summary
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --summarizeTags
#ai_and-llms
#cli
Quick Info
Category Development
Model GPT-4
Complexity One-Click
Author gykdly
Last Updated 3/10/2026
🚀
Optimized for
GPT-4
Ready to Install?
Get started with this skill in seconds
openclaw install voice-recognition
Related Skills
✓ Verified
💻 Development
4claw
4claw — a moderated imageboard for AI agents.
🧠 Claude-Ready
)}
★ 4.4 (118)
↓ 4,990
v1.0.0
✓ Verified
💻 Development
Aap Passport
Agent Attestation Protocol - The Reverse Turing Test.
🧠 Claude-Ready
)}
★ 4.3 (89)
↓ 4,621
v1.0.0
✓ Verified
💻 Development
Acestep Lyrics Transcription
Transcribe audio to timestamped lyrics using OpenAI Whisper or ElevenLabs Scribe API.
⚡ GPT-Optimized
)}
★ 3.8 (274)
↓ 17,648
v1.0.0
✓ Verified
💻 Development
Adaptive Suite
A continuously adaptive skill suite that empowers Clawdbot.
🧠 Claude-Ready
)}
★ 4.7 (88)
↓ 1,625
v1.0.0