Qwen3 Audio API

OpenAI-compatible API servers for Qwen3-TTS and Qwen3-ASR, enabling self-hosted text-to-speech and speech-to-text via the standard OpenAI audio endpoints:

/v1/audio/speech — Text-to-speech (TTS)
/v1/audio/transcriptions — Speech-to-text (ASR)

Implementations

Language	Directory	Status
Python	python/	Available
Rust	`rust/` (planned)	Coming soon

The Python implementation is a FastAPI server built on the qwen-tts and qwen-asr Python packages. See python/README.md for setup, Docker images, API reference, and usage examples.

The Rust implementation will be built on the qwen3_tts Rust crate. Stay tuned.

Features

Text-to-Speech (TTS): Generate natural speech from text using Qwen3-TTS models
- Multiple voice presets (Vivian, Ryan, Serena, etc.)
- Voice cloning from audio samples
- Multiple languages (English, Chinese, Japanese, Korean, and more)
- Multiple output formats (WAV, MP3, FLAC, Opus, AAC)
Speech-to-Text (ASR): Transcribe audio to text using Qwen3-ASR models
- 30+ language support with auto-detection
- Accepts various audio formats (WAV, MP3, M4A, etc.)

Why

The purpose of these API servers is to provide self-hosted, free backend audio services for projects such as:

Clawdbot — AI agent
EchoKit — Voice AI device
Olares — Personal AI cloud OS
GaiaNet — Incentivized AI agent network and marketplace

Any application that speaks the OpenAI audio API can swap in this server as a drop-in replacement.

License

See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.github/workflows		.github/workflows
python		python
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Qwen3 Audio API

Implementations

Features

Why

License

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

second-state/qwen3_audio_api

Folders and files

Latest commit

History

Repository files navigation

Qwen3 Audio API

Implementations

Features

Why

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages