✓ Verified
💻 Development
✓ Enhanced Data
Vibevoice
Local Spanish TTS using Microsoft VibeVoice.
- Rating
- 4.6 (210 reviews)
- Downloads
- 6,246 downloads
- Version
- 1.0.0
Overview
Local Spanish TTS using Microsoft VibeVoice.
Complete Documentation
View Source →
VibeVoice TTS
Local text-to-speech using Microsoft's VibeVoice model. Generates natural Spanish voice audio, perfect for WhatsApp voice messages.
Quick Start
bash
# Basic usage
{baseDir}/scripts/vv.sh "Hola, esto es una prueba" -o /tmp/audio.ogg
# From file
{baseDir}/scripts/vv.sh -f texto.txt -o /tmp/audio.ogg
# Different voice
{baseDir}/scripts/vv.sh "Texto" -v en-Wayne -o /tmp/audio.ogg
# Adjust speed (0.5-2.0)
{baseDir}/scripts/vv.sh "Texto" -s 1.2 -o /tmp/audio.ogg
Configuration
| Setting | Default | Description |
|---|---|---|
| Voice | sp-Spk1_man | Spanish male voice (slight Mexican accent) |
| Speed | 1.15 | 15% faster than normal |
| Format | .ogg | Opus codec for WhatsApp |
Available Voices
Spanish:
sp-Spk1_man- Male, slight Mexican accent (default)
en-Wayne- Maleen-Denise- Female- Other voices in
~/VibeVoice/demo/voices/streaming_model/
Output Formats
.ogg- Opus codec (WhatsApp compatible, recommended).mp3- MP3 format.wav- Uncompressed WAV
For WhatsApp
Always use .ogg format with asVoice=true in the message tool:
bash
# Generate
{baseDir}/scripts/vv.sh "Tu mensaje aquí" -o /tmp/mensaje.ogg
# Send via message tool
message action=send channel=whatsapp to="+34XXXXXXXXX" filePath=/tmp/mensaje.ogg asVoice=true
Requirements
- GPU: NVIDIA with ~2GB VRAM
- VibeVoice: Installed at
~/VibeVoice - ffmpeg: For audio conversion
- Python 3.10+: With torch, torchaudio
Performance
- RTF: ~0.24x (generates faster than realtime)
- 1 minute of audio ≈ 15 seconds to generate
Notes
- First run loads model (~10s), subsequent runs are faster
- Audio rule: Only send voice if user requests it or speaks via audio
- Keep text under 1500 chars for best quality
Installation
Terminal bash
openclaw install vibevoice
Copied!
💻Code Examples
{baseDir}/scripts/vv.sh "Texto" -s 1.2 -o /tmp/audio.ogg
basedirscriptsvvsh-texto--s-12--o-tmpaudioogg.txt
## Configuration
| Setting | Default | Description |
|---------|---------|-------------|
| Voice | `sp-Spk1_man` | Spanish male voice (slight Mexican accent) |
| Speed | `1.15` | 15% faster than normal |
| Format | `.ogg` | Opus codec for WhatsApp |
## Available Voices
Spanish:
- `sp-Spk1_man` - Male, slight Mexican accent (default)
English:
- `en-Wayne` - Male
- `en-Denise` - Female
- Other voices in `~/VibeVoice/demo/voices/streaming_model/`
## Output Formats
- `.ogg` - Opus codec (WhatsApp compatible, recommended)
- `.mp3` - MP3 format
- `.wav` - Uncompressed WAV
## For WhatsApp
Always use `.ogg` format with `asVoice=true` in the message tool:example.sh
# Basic usage
{baseDir}/scripts/vv.sh "Hola, esto es una prueba" -o /tmp/audio.ogg
# From file
{baseDir}/scripts/vv.sh -f texto.txt -o /tmp/audio.ogg
# Different voice
{baseDir}/scripts/vv.sh "Texto" -v en-Wayne -o /tmp/audio.ogg
# Adjust speed (0.5-2.0)
{baseDir}/scripts/vv.sh "Texto" -s 1.2 -o /tmp/audio.oggexample.sh
# Generate
{baseDir}/scripts/vv.sh "Tu mensaje aquí" -o /tmp/mensaje.ogg
# Send via message tool
message action=send channel=whatsapp to="+34XXXXXXXXX" filePath=/tmp/mensaje.ogg asVoice=true⚙️Configuration Options
| Option | Type | Default | Description |
|---|---|---|---|
Voice | string | sp-Spk1_man | Spanish male voice (slight Mexican accent) |
Speed | string | 1.15 | 15% faster than normal |
Format | string | .ogg | Opus codec for WhatsApp |
Tags
#web_and-frontend-development
Quick Info
Category Development
Model Claude 3.5
Complexity One-Click
Author javier887
Last Updated 3/10/2026
🚀
Optimized for
Claude 3.5
Ready to Install?
Get started with this skill in seconds
openclaw install vibevoice
Related Skills
✓ Verified
💻 Development
4claw
4claw — a moderated imageboard for AI agents.
🧠 Claude-Ready
)}
★ 4.4 (118)
↓ 4,990
v1.0.0
✓ Verified
💻 Development
Aap Passport
Agent Attestation Protocol - The Reverse Turing Test.
🧠 Claude-Ready
)}
★ 4.3 (89)
↓ 4,621
v1.0.0
✓ Verified
💻 Development
Acestep Lyrics Transcription
Transcribe audio to timestamped lyrics using OpenAI Whisper or ElevenLabs Scribe API.
⚡ GPT-Optimized
)}
★ 3.8 (274)
↓ 17,648
v1.0.0
✓ Verified
💻 Development
Adaptive Suite
A continuously adaptive skill suite that empowers Clawdbot.
🧠 Claude-Ready
)}
★ 4.7 (88)
↓ 1,625
v1.0.0