An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,805 217 Updated Apr 30, 2025

Zyphra / Zonos

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 6,637 729 Updated Mar 5, 2025

zhenye234 / LLaSA_training

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 564 39 Updated Apr 8, 2025

hexgrad / kokoro

https://hf.co/hexgrad/Kokoro-82M

JavaScript 2,919 315 Updated May 3, 2025

SparkAudio / Spark-TTS

Spark-TTS Inference Code

Python 9,501 996 Updated Apr 9, 2025

mbzuai-oryx / LLMVoX

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Python 252 30 Updated May 16, 2025

Lynkes / F5TTS-FastInference

Python 8 Updated May 5, 2025

SesameAILabs / csm

A Conversational Speech Generation Model

Python 13,322 1,266 Updated Mar 27, 2025

canopyai / Orpheus-TTS

Towards Human-Sounding Speech

Python 4,816 387 Updated May 6, 2025

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 32,342 3,343 Updated Apr 19, 2025

Plachtaa / seed-vc

zero-shot voice conversion & singing voice conversion, with real-time support

Python 2,508 289 Updated Apr 20, 2025

dbccccccc / ttsfm

TTSFM is a reverse-engineered API server that mirrors OpenAI's TTS service, providing a compatible interface for text-to-speech conversion with multiple voice options.

HTML 425 73 Updated Apr 9, 2025

Ryuk17 / SpeechAlgorithms

You can find the speech algorithms you want here

C 803 249 Updated Jan 1, 2025

bytedance / MegaTTS3

Python 5,354 378 Updated May 11, 2025

FireRedTeam / FireRedASR

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 992 77 Updated Mar 27, 2025

index-tts / index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 1,870 168 Updated May 21, 2025

lucasjinreal / Kokoros

🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with high quality you ever have.

Rust 514 53 Updated May 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cwr250

Block or report cwr250

tts

huggingface / parler-tts

FunAudioLLM / CosyVoice

fishaudio / fish-speech

SYSTRAN / faster-whisper

netease-youdao / EmotiVoice

myshell-ai / MeloTTS

shivammehta25 / Matcha-TTS

p0p4k / Matcha-TTS-2

coqui-ai / TTS

idiap / coqui-ai-TTS

edwko / OuteTTS

facebookresearch / spiritlm

Z-yq / TensorflowTTS

modelscope / ClearerVoice-Studio