Docker for multiple TTS Engines with a GRadio interface
-
Updated
Aug 29, 2024 - Jupyter Notebook
Docker for multiple TTS Engines with a GRadio interface
Voice-controlled robotic assistant with natural language processing, command validation, and speech synthesis. Built with a microservices architecture.
This case study uses Multimodal Generative AI (text, image, audio, video) to create a complete, professional digital marketing campaign for the small bakery, demonstrating a cost-effective content creation process.
Generate podcast-style audio locally with multiple free TTS engines. Supports edge-tts, Bark and Parler-TTS. Inspired by Google NotebookLM.
This repository serves as the official open-source evaluation hub for a premium, high-fidelity Conversational Female Monologue Dataset. This data addresses the critical shortage of natural human velocity, spontaneous breath placement, and unscripted vocal cadence in traditional training corpora.
Open source multilingual voice OS — 22 Indian languages via AI4Bharat (IndicTrans2, IndicConformer, Parler-TTS, IndicXlit) + faster-whisper. Jarvis-style desktop assistant.
Fine-tuned Parler-TTS (600M) for Hinglish language, Indian accent, and emotion-conditioned speech synthesis. Published at arXiv:2506.16310.
Add a description, image, and links to the parler-tts topic page so that developers can more easily learn about it.
To associate your repository with the parler-tts topic, visit your repo's landing page and select "manage topics."