✓ Verified ✍️ Content Creation ✓ Enhanced Data

Mlx Audio Server

A fast, accurate, and fully local OpenAI-compatible API.

Rating
4.6 (181 reviews)
Downloads
7,003 downloads
Version
1.0.0

Overview

A fast, accurate, and fully local OpenAI-compatible API.

Complete Documentation

View Source →

MLX Audio Server

Local 24x7 OpenAI-compatible API server for STT/TTS, powered by MLX on your Mac.

mlx-audio: The best audio processing library built on Apple's MLX framework, providing fast and efficient text-to-speech (TTS), speech-to-text (STT), and speech-to-speech (STS) on Apple Silicon.

guoqiao/tap/mlx-audio-server: Homebrew Formula to install mlx-audio with brew, and run mlx_audio.server as a LaunchAgent service on macOS.

Requirements

  • mlx: macOS with Apple Silicon
  • brew: used to install deps if not available

Installation

bash
bash ${baseDir}/install.sh
This script will:
  • install ffmpeg/jq with brew if missing.
  • install homebrew formula mlx-audio-server from guoqiao/tap
  • start brew service for mlx-audio-server

Usage

STT/Speech-To-Text(default model: mlx-community/glm-asr-nano-2512-8bit):

bash
# input will be converted to wav with ffmpeg, if not yet.
# output will be transcript text only.
bash ${baseDir}/run_stt.sh <audio_or_video_path>

TTS/Text-To-Speech(default model: mlx-community/Qwen3-TTS-12Hz-1.7B-VoiceDesign-bf16):

bash
# audio will be saved into a tmp dir, with default name `speech.wav`, and print to stdout.
bash ${baseDir}/run_tts.sh "Hello, Human!"
# or you can specify a output dir
bash ${baseDir}/run_tts.sh "Hello, Human!" ./output
# output will be audio path only.
You can use both scripts directly, or as example/reference.

Installation

Terminal bash

openclaw install mlx-audio-server
    
Copied!

💻Code Examples

bash ${baseDir}/install.sh

bash-basedirinstallsh.txt
This script will:
- install ffmpeg/jq with brew if missing.
- install homebrew formula `mlx-audio-server` from `guoqiao/tap`
- start brew service for `mlx-audio-server`

## Usage

STT/Speech-To-Text(default model: **mlx-community/glm-asr-nano-2512-8bit**):
example.sh
# input will be converted to wav with ffmpeg, if not yet.
# output will be transcript text only.
bash ${baseDir}/run_stt.sh <audio_or_video_path>
example.sh
# audio will be saved into a tmp dir, with default name `speech.wav`, and print to stdout.
bash ${baseDir}/run_tts.sh "Hello, Human!"
# or you can specify a output dir
bash ${baseDir}/run_tts.sh "Hello, Human!" ./output
# output will be audio path only.

Tags

#media_and-streaming #api

Quick Info

Category Content Creation
Model GPT-4
Complexity One-Click
Author guoqiao
Last Updated 3/10/2026
🚀
Optimized for
GPT-4

Ready to Install?

Get started with this skill in seconds

openclaw install mlx-audio-server