Skip to content
View cwr250's full-sized avatar

Block or report cwr250

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

tts

17 repositories

Inference and training library for high-quality TTS models.

Python 5,095 536 Updated Dec 10, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,557 1,148 Updated Mar 7, 2025

SOTA Open Source TTS

Python 19,768 1,527 Updated Mar 3, 2025

Faster Whisper transcription with CTranslate2

Python 14,571 1,227 Updated Jan 1, 2025

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,717 660 Updated Aug 13, 2024

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Python 5,704 762 Updated Dec 24, 2024

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 903 117 Updated Mar 3, 2025

E2E TTS using Conditional Flow Matching (Experimental*)

Jupyter Notebook 69 5 Updated Nov 10, 2023

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 38,212 4,786 Updated Aug 16, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 1,065 117 Updated Feb 26, 2025

Interface for OuteTTS models.

Python 943 82 Updated Feb 14, 2025

Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".

Python 886 57 Updated Oct 28, 2024

超快的中文普通话TTS

Python 118 23 Updated Apr 2, 2021

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,351 176 Updated Feb 14, 2025

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 5,884 605 Updated Mar 5, 2025

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 436 34 Updated Feb 14, 2025

https://hf.co/hexgrad/Kokoro-82M

JavaScript 1,456 145 Updated Mar 1, 2025