Skip to content
View cwr250's full-sized avatar

Block or report cwr250

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

tts

32 repositories

Inference and training library for high-quality TTS models.

Python 5,259 556 Updated Dec 10, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 13,982 1,450 Updated May 22, 2025

SOTA Open Source TTS

Python 21,179 1,695 Updated Apr 12, 2025

Faster Whisper transcription with CTranslate2

Python 16,148 1,333 Updated Apr 29, 2025

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,989 688 Updated Aug 13, 2024

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Python 6,084 837 Updated Dec 24, 2024

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 1,003 132 Updated May 5, 2025

E2E TTS using Conditional Flow Matching (Experimental*)

Jupyter Notebook 69 5 Updated Nov 10, 2023

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 40,193 5,152 Updated Aug 16, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 1,380 164 Updated May 22, 2025

Interface for OuteTTS models.

Python 1,275 106 Updated May 20, 2025

Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".

Python 904 59 Updated Oct 28, 2024

超快的中文普通话TTS

Python 119 23 Updated Apr 2, 2021

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,805 217 Updated Apr 30, 2025

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 6,637 729 Updated Mar 5, 2025

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 564 39 Updated Apr 8, 2025

https://hf.co/hexgrad/Kokoro-82M

JavaScript 2,919 315 Updated May 3, 2025

Spark-TTS Inference Code

Python 9,501 996 Updated Apr 9, 2025

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Python 252 30 Updated May 16, 2025
Python 8 Updated May 5, 2025

A Conversational Speech Generation Model

Python 13,322 1,266 Updated Mar 27, 2025

Towards Human-Sounding Speech

Python 4,816 387 Updated May 6, 2025

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 32,342 3,343 Updated Apr 19, 2025

zero-shot voice conversion & singing voice conversion, with real-time support

Python 2,508 289 Updated Apr 20, 2025

TTSFM is a reverse-engineered API server that mirrors OpenAI's TTS service, providing a compatible interface for text-to-speech conversion with multiple voice options.

HTML 425 73 Updated Apr 9, 2025

You can find the speech algorithms you want here

C 803 249 Updated Jan 1, 2025
Python 5,354 378 Updated May 11, 2025

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 992 77 Updated Mar 27, 2025

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 1,870 168 Updated May 21, 2025

🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with high quality you ever have.

Rust 514 53 Updated May 7, 2025