A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
-
Updated
Feb 18, 2026 - Python
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
ComfyUI node for highly expressive speech and realistic zero-shot voice cloning
AI-powered multi-voice audiobook generator — LLM script annotation, voice cloning, voice design, LoRA training, per-line style control, and export to MP3, chaptered M4B, or Audacity multi-track. Built on Qwen3-TTS.
EDUMCP is a protocol that integrates the Model Context Protocol (MCP) with applications in the education field, dedicated to achieving seamless interconnection and interoperability among different AI models, educational applications, smart hardware, and teaching AGENTs.
Beautiful voice app: record or upload to train a voice, generate speech from text or files, save & download voices.
Official AllVoiceLab Model Context Protocol (MCP) server, supporting interaction with powerful text-to-speech and video translation APIs.
✨ NovelAI api python sdk, easy to use, modern and user-friendly.
ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text
OpenAI TTS Compatible Ukrainian TTS StyleTTS2 Pipeline
AI generates conversational podcast for ANY research paper, vividly!
A Docker-based OpenAI-compatible Text-to-Speech API server powered by Kyutai's TTS models with GPU acceleration support.
Local, portable GUI for Qwen3-TTS. Optimized for NVIDIA RTX 50 Series (CUDA 12.8). One-click install.
Fast, local, OpenAI-compatible TTS server with voice cloning support powered by Kyutai's Pocket TTS
EasyTTS是一个便捷的工具,旨在方便地使用第三方API服务来调用OpenAI的文本转语音(TTS)功能。 EasyTTS允许用户输入文本,并选择不同的模型、音色、格式来生成音频文件。
Production-ready RunPod serverless endpoint for Kokoro TTS. Features high-quality text-to-speech, voice mixing, word-level timestamps, and phoneme generation. Optimized for fast cold starts and auto-scaling.
AI-powered podcast generator with fast parallel research and natural conversational
High-performance KittenTTS API server with a built-in web UI, OpenAI-compatible routes, long-form text support, and optional CUDA acceleration.
Tool for scraping posts and corresponding comments from reddit, adding music and voiceovers, creating the shorts and automatically uploading to Youtube
An experimental TTS workspace for expressive Chinese voice generation, cinematic narration, and AI audio prototyping.
Add a description, image, and links to the voice-generation topic page so that developers can more easily learn about it.
To associate your repository with the voice-generation topic, visit your repo's landing page and select "manage topics."