Stars
[arXiv 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
Taming FLUX for Image Inversion & Editing; OpenSora for Video Inversion & Editing! (Official implementation for Taming Rectified Flow for Inversion and Editing.)
[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Mu…
[ICLR 2024] Generalizable and Precise Head Avatar from Image(s)
Metrical Monocular Photometric Tracker [ECCV2022]
This is the official repository for the paper "Texture-Preserving Diffusion Models for High-Fidelity Virtual Try-On". CVPR 2024
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
[CSUR] A Survey on Video Diffusion Models
collection of diffusion model papers categorized by their subareas
Chinese Stable Diffusion, zh SD,中文文生图,中文SD,中文Stable Diffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
This repository contains the implementations of the preprocessing stages of VITON-HD
Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
A collection of resources on controllable generation with text-to-image diffusion models.
Let us democratise high-resolution generation! (CVPR 2024)
[CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
Digital Human Resource Collection: 2D/3D/4D human modeling, avatar generation & animation, clothed people digitalization, virtual try-on, and others.
A Trimap-Free Portrait Matting Solution in Real Time [AAAI 2022]
[ICCV 2023] Consistent Image Synthesis and Editing
CLIP+MLP Aesthetic Score Predictor
animatediff prompt travel
📚 A collection of Deep Learning based Image Colorization and Video Colorization papers.
[NeurIPS 2023] Official implementations for paper: Customizable Image Synthesis with Multiple Subjects
Kandinsky 2 — multilingual text2image latent diffusion model
[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation