🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
-
Updated
Aug 16, 2024 - Python
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
zero-shot realtime TTS system, fully offline, free and open source
Persian text-to-speech streamlit interface
🎙️ Arabic TTS models (FastPitch, Mixer-TTS) in the ONNX format — Python package for offline speech synthesis 🚀📦
Train voice styles for Supertone/supertonic-3 model.
Transform any video into a professional multilingual production with natural voice cloning, lip-sync, and on-screen text translation. No cloud APIs, no subscriptions, no data leaving your machine.
😻 A simple ComfyUI custom node for KittenTTS - an ultra-lightweight text-to-speech model. Works on CUDA and CPU.
Contains voice models based on the GPT-SoVITS architecture of different characters including Hitori Gotoh, Ikuyo Kiya and Ichiji Nijika trained from voices from the anime "Bocchi the Rock!".
Voice Agent responds like humans for the sales teams to qualify the leads and different use cases
🎤 Enhance multilingual communication with T5Gemma-TTS, a versatile Text-to-Speech model supporting easy training and inference.
🐱 Kitten TTS Studio using local onnx models TTS Offline
XTTS fine-tuning via CLI
A Streamlit web app for AI-powered voice cloning using Coqui XTTS v2. Record or upload reference voices, clone speech in multiple languages, and generate natural audio outputs.
Audiobook Simplifier is a tool that creates audiobooks from text documents or eBooks using TTS (Text-to-Speech) technology.
Generate AI-voiced short videos with synced subtitles using Python, ElevenLabs TTS, and FFmpeg for clear, automated social media and educational content.
🗣️ Enable text-to-speech with Qwen TTS, a simple API solution that seamlessly integrates into your applications using Docker and Home Assistant.
This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for accelerated training.
Add a description, image, and links to the tts-model topic page so that developers can more easily learn about it.
To associate your repository with the tts-model topic, visit your repo's landing page and select "manage topics."