Stars
MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion
Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video”
The most customisable and low-latency cross platform/shell prompt renderer
[NeurIPS 2024] Official implementation of "Grid4D: 4D Decomposed Hash Encoding for High-fidelity Dynamic Gaussian Splatting"
[NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation
Fast and memory efficient PyTorch implementation of the Perceiver with FlashAttention.
MeshGPT: Generating Triangle Meshes with Decoder-Only Transformers
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
[Siggraph Asia 2024] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation"
Memory optimized finetuning scripts for CogVideoX using TorchAO and DeepSpeed
Official Code Release for SIGGRAPH Asia 2024 Paper: GS^3: Efficient Relighting with Triple Gaussian Splatting
Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging
Huggingface cloth segmentation using U2NET
Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"
[CVPR'24] DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation
Official implementation for the SIGGRAPH Asia 2024 paper SPARK: Self-supervised Personalized Real-time Monocular Face Capture
High-quality Text-to-Audio Generation with Efficient Diffusion Transformer
Implementation of Diffusion Transformer Model in Pytorch
[CVPR 2023] CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior
[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models
[ECCV 2024] SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction
[SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal
3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion
(SIGGRAPH Asia 2024) This is the official PyTorch implementation of SIGGRAPH Asia 2024 paper: DrawingSpinUp: 3D Animation from Single Character Drawings
Official Code for ICCV 2021 paper "Towards Flexible Blind JPEG Artifacts Removal (FBCNN)"
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
YuliangXiu / StableNormal-1
Forked from Stable-X/StableNormal[SIGGRAPH Asia 2024 & TOG] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal