✓ Verified 💻 Development ✓ Enhanced Data

Comfyui Tts

Generate speech audio using ComfyUI Qwen-TTS service.

Rating
4.1 (158 reviews)
Downloads
2,912 downloads
Version
1.0.0

Overview

Generate speech audio using ComfyUI Qwen-TTS service.

Complete Documentation

View Source →

ComfyUI TTS Skill

Generate speech audio using ComfyUI's Qwen-TTS service. This skill allows you to convert text to speech through ComfyUI's API.

Configuration

Environment Variables

Set these environment variables to configure the ComfyUI connection:

bash
export COMFYUI_HOST="localhost"      # ComfyUI server host
export COMFYUI_PORT="8188"           # ComfyUI server port
export COMFYUI_OUTPUT_DIR=""         # Optional: Custom output directory

Usage

Basic Text-to-Speech

Generate audio from text using default settings:

bash
scripts/tts.sh "你好,世界"

Advanced Options

Customize voice characteristics:

bash
# Specify character and style
scripts/tts.sh "你好" --character "Girl" --style "Emotional"

# Change model size
scripts/tts.sh "你好" --model "3B"

# Specify output file
scripts/tts.sh "你好" --output "/path/to/output.wav"

# Combine options
scripts/tts.sh "你好,这是测试" \
  --character "Girl" \
  --style "Emotional" \
  --model "1.7B" \
  --output "~/audio/test.wav"

Available Options

OptionDescriptionDefault
--characterVoice character (Girl/Boy/etc.)"Girl"
--styleSpeaking style (Emotional/Neutral/etc.)"Emotional"
--modelModel size (0.5B/1.7B/3B)"1.7B"
--outputOutput file pathAuto-generated
--temperatureGeneration temperature (0-1)0.9
--top-pTop-p sampling0.9
--top-kTop-k sampling50

Workflow

The skill performs these steps:

  • Construct Workflow: Builds a ComfyUI workflow JSON with your text and settings
  • Submit Job: Sends the workflow to ComfyUI's /prompt endpoint
  • Poll Status: Monitors job completion via /history endpoint
  • Retrieve Audio: Returns the path to the generated audio file

Troubleshooting

Connection Refused

  • Verify ComfyUI is running: curl http://$COMFYUI_HOST:$COMFYUI_PORT/system_stats
  • Check host and port settings

Job Timeout

  • Large models (3B) take longer to generate
  • Try smaller models (0.5B, 1.7B) for faster results

Output Not Found

  • Check ComfyUI's output directory configuration
  • Verify file permissions

API Reference

The skill uses ComfyUI's native API endpoints:

  • POST /prompt - Submit workflow
  • GET /history - Check job status
  • Output files are saved to ComfyUI's configured output directory

Installation

Terminal bash

openclaw install comfyui-tts
    
Copied!

💻Code Examples

export COMFYUI_OUTPUT_DIR="" # Optional: Custom output directory

export-comfyuioutputdir--optional-custom-output-directory.txt
## Usage

### Basic Text-to-Speech

Generate audio from text using default settings:

scripts/tts.sh "你好,世界"

scriptsttssh-.txt
### Advanced Options

Customize voice characteristics:
example.sh
export COMFYUI_HOST="localhost"      # ComfyUI server host
export COMFYUI_PORT="8188"           # ComfyUI server port
export COMFYUI_OUTPUT_DIR=""         # Optional: Custom output directory
example.sh
# Specify character and style
scripts/tts.sh "你好" --character "Girl" --style "Emotional"

# Change model size
scripts/tts.sh "你好" --model "3B"

# Specify output file
scripts/tts.sh "你好" --output "/path/to/output.wav"

# Combine options
scripts/tts.sh "你好,这是测试" \
  --character "Girl" \
  --style "Emotional" \
  --model "1.7B" \
  --output "~/audio/test.wav"

Tags

#git_and-github

Quick Info

Category Development
Model Claude 3.5
Complexity One-Click
Author yhsi5358
Last Updated 3/10/2026
🚀
Optimized for
Claude 3.5
🧠

Ready to Install?

Get started with this skill in seconds

openclaw install comfyui-tts