✓ Verified ✍️ Content Creation ✓ Enhanced Data

Chichi Speech

A RESTful service for high-quality text-to-speech using Qwen3.

Rating
3.9 (111 reviews)
Downloads
18,894 downloads
Version
1.0.0

Overview

A RESTful service for high-quality text-to-speech using Qwen3.

Complete Documentation

View Source →

Chichi Speech Service

This skill provides a FastAPI-based REST service for Qwen3 TTS, specifically configured for reusing a high-quality reference audio prompt for efficient and consistent voice cloning. This service is packaged as an installable CLI.

Installation

Prerequisites: python >= 3.10.

bash
pip install -e .

Usage

1. Start the Service

The service runs on port 9090 by default.

bash
# Start the server (runs in foreground, use & for background or a separate terminal)
# Optional: Uudate to your own reference audio and text for voice cloning
chichi-speech --port 9090 --host 127.0.0.1 --ref-audio "https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen3-TTS-Repo/clone_2.wav" --ref-text "Okay. Yeah. I resent you. I love you. I respect you. But you know what? You blew it! And thanks to you."

2. Verify Service is Running

Check the health/docs:
bash
curl http://localhost:9090/docs

3. Generate Speech

Use cURL:

bash
curl -X POST "http://localhost:9090/synthesize" \
     -H "Content-Type: application/json" \
     -d '{
           "text": "Nice to meet you",
           "language": "English"
         }' \
     --output output/nice_to_meet.wav

Functionality

  • Endpoint: POST /synthesize
  • Default Port: 9090
  • Voice Cloning: Uses a pre-computed voice prompt from reference files to ensure the cloned voice is consistent and generation is fast.

Requirements

  • Python 3.10+
  • qwen-tts (Qwen3 model library)
  • Access to a reference audio file for voice cloning.
  • By default, it uses public sample audio from Qwen3.
  • CRITICAL: You can provide your own reference audio using the --ref-audio and --ref-text flags.

Installation

Terminal bash

openclaw install chichi-speech
    
Copied!

💻Code Examples

pip install -e .

pip-install--e-.txt
## Usage

### 1. Start the Service

The service runs on port **9090** by default.

chichi-speech --port 9090 --host 127.0.0.1 --ref-audio "https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen3-TTS-Repo/clone_2.wav" --ref-text "Okay. Yeah. I resent you. I love you. I respect you. But you know what? You blew it! And thanks to you."

chichi-speech---port-9090---host-127001---ref-audio-httpsqianwen-resoss-cn-beijingaliyuncscomqwen3-tts-repoclone2wav---ref-text-okay-yeah-i-resent-you-i-love-you-i-respect-you-but-you-know-what-you-blew-it-and-thanks-to-you.txt
### 2. Verify Service is Running
Check the health/docs:

curl http://localhost:9090/docs

curl-httplocalhost9090docs.txt
### 3. Generate Speech

Use cURL:
example.sh
# Start the server (runs in foreground, use & for background or a separate terminal)
# Optional: Uudate to your own reference audio and text for voice cloning
chichi-speech --port 9090 --host 127.0.0.1 --ref-audio "https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen3-TTS-Repo/clone_2.wav" --ref-text "Okay. Yeah. I resent you. I love you. I respect you. But you know what? You blew it! And thanks to you."
example.sh
curl -X POST "http://localhost:9090/synthesize" \
     -H "Content-Type: application/json" \
     -d '{
           "text": "Nice to meet you",
           "language": "English"
         }' \
     --output output/nice_to_meet.wav

Tags

#speech_and-transcription

Quick Info

Category Content Creation
Model Claude 3.5
Complexity One-Click
Author hudeven
Last Updated 3/10/2026
🚀
Optimized for
Claude 3.5
🧠

Ready to Install?

Get started with this skill in seconds

openclaw install chichi-speech