A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
-
Updated
Oct 2, 2025 - Python
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
ComfyUI node for highly expressive speech and realistic zero-shot voice cloning
EDUMCP is a protocol that integrates the Model Context Protocol (MCP) with applications in the education field, dedicated to achieving seamless interconnection and interoperability among different AI models, educational applications, smart hardware, and teaching AGENTs.
Official AllVoiceLab Model Context Protocol (MCP) server, supporting interaction with powerful text-to-speech and video translation APIs.
Beautiful voice app: record or upload to train a voice, generate speech from text or files, save & download voices.
✨ NovelAI api python sdk, easy to use, modern and user-friendly.
ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text
OpenAI TTS Compatible Ukrainian TTS StyleTTS2 Pipeline
AI generates conversational podcast for ANY research paper, vividly!
A Docker-based OpenAI-compatible Text-to-Speech API server powered by Kyutai's TTS models with GPU acceleration support.
EasyTTS是一个便捷的工具,旨在方便地使用第三方API服务来调用OpenAI的文本转语音(TTS)功能。 EasyTTS允许用户输入文本,并选择不同的模型、音色、格式来生成音频文件。
Production-ready RunPod serverless endpoint for Kokoro TTS. Features high-quality text-to-speech, voice mixing, word-level timestamps, and phoneme generation. Optimized for fast cold starts and auto-scaling.
AI-powered podcast generator with fast parallel research and natural conversational
A raspberryPi magic mirror based on facial recognition
Personal surveillance system using screenshots, webcam and OpenAI GPT-4o to check if the user is focused on his tasks. If not, the user will be roasted by GLaDOS voice and character to regain focus again.
Tool for scraping posts and corresponding comments from reddit, adding music and voiceovers, creating the shorts and automatically uploading to Youtube
🎙️ Enhance speech generation and voice cloning using ComfyUI with the VoxCPM integration for token-free, context-aware TTS.
create audio books from pdfs with one click , available on windows , linux, mac
Open-source FastAPI wrapper for F5-TTS. A powerful Text-to-Speech API with real-time voice cloning and streaming support.
Add a description, image, and links to the voice-generation topic page so that developers can more easily learn about it.
To associate your repository with the voice-generation topic, visit your repo's landing page and select "manage topics."