Fine-tune SpeechT5 for non-English text-to-speech task, implemented in PyTorch.
-
Updated
May 28, 2024 - Python
Fine-tune SpeechT5 for non-English text-to-speech task, implemented in PyTorch.
A fine-tuned SpeechT5 Urdu TTS model with voice cloning that converts both Urdu and Roman Urdu text into natural speech. Trained on diverse Urdu and Zia Mohiuddin recordings, it offers expressive, speaker-specific synthesis with a FastAPI demo for easy testing.
Assignment 2: Fine-tuning Text-to-Speech (TTS) Models for English Technical Speech and Regional Languages
Add a description, image, and links to the speecht5-tts topic page so that developers can more easily learn about it.
To associate your repository with the speecht5-tts topic, visit your repo's landing page and select "manage topics."