🔊 Text-prompted Generative Audio Model - With the ability to clone voices
-
Updated
Aug 24, 2025 - Jupyter Notebook
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.
AI Voice Cloning Desktop Application that runs locally on your computer and doesn't cost anything to run
Run XTTS within Docker/Podman for voice fine-tuning in Gradio's Web UI
Add a description, image, and links to the ai-voice-clone topic page so that developers can more easily learn about it.
To associate your repository with the ai-voice-clone topic, visit your repo's landing page and select "manage topics."