Stars
A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Step Audio EditX, IndexTTS-2, Chatterbox (classic and multilingual 23-lang), F5…
Create shapes that follow a spline path. Import background image, edit splines, and export for use in VACE.
Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.
[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
[CVPR 2025] MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation
LBM: Latent Bridge Matching for Fast Image-to-Image Translation ✨ (ICCV 2025 Highlight)
SynCD: Generating Multi-Image Synthetic Data for Text-to-Image Customization (ICCV 2025)
deepbeepmeep / Wan2GP
Forked from Wan-Video/Wan2.1A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.
Enable AI models for video production in the browser
Two conversational AI agents switching from English to sound-level protocol after confirming they are both AI agents
Pippo: High-Resolution Multi-View Humans from a Single Image
🔎 Utility for visualizing the stencil buffer in Unity
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
Unity component offering functionality similiar to LineRenderer but in 3D
An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.
Nodes for image juxtaposition for Flux in ComfyUI
CogVideoX-LoRAs is a centralized repository for all LoRA models created for CogVideoX, filling the gap for a unified sharing space. With the rising demand for customized video generation, this hub …
Select a portrait, click to move the head around (please use your own space / GPU!)
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Real-time latent exploration of diffusion models
A Unity package to run pretrained diffusion models with Unity Sentis
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
This is a simple template using HuggingFace Accelerator for DDP-training/Saving/Loading/Pushing.
ONNX-compatible Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data



