Stars
[CVPR 2025 Oral] OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
FaceLift: Learning Generalizable Single Image 3D Face Reconstruction from Synthetic Heads
Official Implementation of "PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting"
FreeVS: Generative View Synthesis on Free Driving Trajectory
The official implementation of Tensor ProducT ATTenTion Transformer (T6)
This repository will host the code for the SIGGRAPH Asia 2024 Paper titled: "GaussianHeads: End-to-End Learning of Drivable Gaussian Head Avatars from Coarse-to-fine Representations"
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
[ICLR'25 Oral] No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images
[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant
[ECCV 2024 Oral🔥] Arc2Face: A Foundation Model for ID-Consistent Human Faces
[IJCAI'24] Beyond Alignment: Blind Video Face Restoration via Parsing-Guided Temporal-Coherent Transformer
A paper list of some recent works about Token Compress for Vit and VLM
Implementation of Alphafold 3 from Google Deepmind in Pytorch
Image forgery recognition algorithm
[InterSpeech 2024] Official code repository of paper titled "Bird Whisperer: Leveraging Large Pre-trained Acoustic Model for Bird Call Classification" accepted in InterSpeech 2024 conference.
This is official implementation of the paper: "iHuman: Instant Animatable Digital Humans From Monocular Videos" [ECCV 2024]
PyTorch implementation of the ICCV paper "GRAM-HD: 3D-Consistent Image Generation at High Resolution with Generative Radiance Manifolds"
The official implementation of Distribution Backtracking Distillation for One-step Diffusion Models
To support and further the research in the field of portrait animation , we are excited to launch PhotoPoster, an open project for pose-driven image generation.
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Official repo for VGGHeads: 3D Multi Head Alignment with a Large-Scale Synthetic Dataset..