Lists (1)
Sort Name ascending (A-Z)
Stars
Official Repository of **CaPa**: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation
Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction'
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
4DS (Scaling 4D Representations) BetterDepth ChronoDepth Depth Any Video Depth Anything Depth Pro DepthCrafter DINOv2 FutureDepth GBDMF GenPercept GeoWizard LeReS LightedDepth Marigold Metric3D MiD…
DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling
A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Official repository for "Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders"
Synthetic animal image dataset for pose and shape reconstruction.
Illumination Drawing Tools for Text-to-Image Diffusion Models
Diffusers reimplementation for https://rf-inversion.github.io/
“FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with any VAE.
Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation
Code release of our paper "DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation".
Official repo for "IDArb: Intrinsic Decomposition for arbitrary number of input views and illuminations"
Taming FLUX for Image Inversion & Editing; OpenSora for Video Inversion & Editing! (Official implementation for Taming Rectified Flow for Inversion and Editing.)
Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"
[NeurIPS 2024] L4GM: Large 4D Gaussian Reconstruction Model
Zero-Shot Monocular Depth Completion with Guided Diffusion
🍳 [arXiv'24] PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting
[NeurIPS'24] Official implementation of "HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors"
Code for "StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models", Arxiv 2024
[NeurIPS 2024] Atlas3D: Physically Constrained Self-Supporting Text-to-3D for Simulation and Fabrication
Official code for "LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models"