VoiceGenAI is a modern Text-to-Speech (TTS) Generative AI application built using Hugging Face models and Streamlit, capable of converting text into natural-sounding human speech.
✅ Text-to-Speech using state-of-the-art Hugging Face models
✅ Streamlit-based interactive UI
✅ Play & download generated audio
✅ Supports Hugging Face Inference API
✅ Optional local offline TTS (Coqui TTS)
✅ Clean, modular, production-ready Python code
✅ Docker-ready for deployment
| Model | Description |
|---|---|
facebook/mms-tts-eng |
High-quality multilingual TTS |
espnet/kan-bayashi-ljspeech |
Research-grade neural TTS |
Coqui TTS (optional) |
Offline/local speech synthesis |
- Python 3.9+
- Streamlit
- Hugging Face Transformers / Inference API
- PyTorch
- Coqui TTS (optional)
- Docker
VoiceGenAI/
├── app.py
├── tts_utils.py
├── requirements.txt
├── .env.example
├── Dockerfile
├── README.md
└── samples/
git clone https://github.com/ramarav/VoiceGenAI.git
cd VoiceGenAIpython -m venv venv
source venv/bin/activate # Windows: venv\Scripts\activatepip install -r requirements.txt- Create token: https://huggingface.co/settings/tokens
- Create
.envfile:
HF_TOKEN=hf_your_token_herestreamlit run app.pyOpen browser at 👉 http://localhost:8501
🎧 Text:
Welcome to VoiceGenAI, your personal text to speech assistant.
🔊 Output:
- Natural human-like speech
- Downloadable
.wavfile
docker build -t voicegenai .
docker run -p 8501:8501 --env HF_TOKEN=hf_xxx voicegenai- Accessibility tools
- Voice assistants
- AI narration
- Content creation
- Chatbots with voice
- EdTech / E-learning platforms
- Multi-voice selection
- SSML support
- Batch TTS (CSV upload)
- Language auto-detection
- Cloud deployment (AWS/GCP)
- REST API version
Contributions are welcome!
Fork the repo, create a feature branch, and submit a PR 🚀
If you like this project:
- ⭐ Star the repository
- 🍴 Fork it
- 🧑💻 Share it with the community
This project is licensed under the MIT License.
#GenerativeAI #TextToSpeech #HuggingFace #Streamlit #Python
#AIProjects #OpenSource #MachineLearning #DeepLearning #VoiceAI
Mekala Ramarao
Python Developer | AI/ML Engineer
GPU & AI Systems | Open Source Contributor
🔥 Built for engineers and AI enthusiasts.