Lists (9)
Sort Name ascending (A-Z)
Stars
🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Simple code for generating a color-coded latex table from raw data
[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)
[NeurIPS'24] Official implementation of "HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors"
PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space
A PyTorch native library for large model training
Open source impl of **MV-DUSt3R+ Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds** from Meta Reality Labs. Project page https://mv-dust3rp.github.io/
Toward Interactive Regional Understanding in Vision-Large Language Models (NAACL 2024)
[3DV 2025 Oral]: A Large-scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining
SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
[NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D
[2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
[NeurIPS 2024 Spotlight] Tetrahedron Splatting for 3D Generation
[NIPS'24] Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection
[NeurIPS 2024] OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding
😎 Awesome lists of papers and codes about open-vocabulary perception, including both 3D and 2D
Implementation for ECCV 2022 paper Language-Grounded Indoor 3D Semantic Segmentation in the Wild
Open-Sora: Democratizing Efficient Video Production for All
SplatFormer: Point Transformer for Robust 3D Gaussian Splatting
DreamUV - 3D viewport UV editing tools for Blender
[AAAI' 25] Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior
[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
Code for "EnvGS: Modeling View-Dependent Appearance with Environment Gaussian", arXiv 2024. Including a fully differentiable 2D Gaussian ray tracer built on 2DGS and OptiX, supporting multiple-boun…
An official implementation of RoDyGS: Robust Dynamic Gaussian Splatting for Casual Videos