OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding

Dianyi Yang, Yu Gao, Xihan Wang, Yufeng Yue, Yi Yang∗, Mengyin Fu

Video | Project Page

All the reported results are obtained from a single Nvidia RTX 4090 GPU.

Abstract: Recent advancements in 3D Gaussian Splatting have significantly improved the efficiency and quality of dense semantic SLAM. However, previous methods are generally constrained by limited-category pre-trained classifiers and implicit semantic representation, which hinder their performance in open-set scenarios and restrict 3D object-level scene understanding. To address these issues, we propose OpenGSSLAM, an innovative framework that utilizes 3D Gaussian representation to perform dense semantic SLAM in open-set environments. Our system integrates explicit semantic labels derived from 2D foundational models into the 3D Gaussian framework, facilitating robust 3D object-level scene understanding. We introduce Gaussian Voting Splatting to enable fast 2D label map rendering and scene updating. Additionally, we propose a Confidence-based 2D Label Consensus method to ensure consistent labeling across multiple views. Furthermore, we employ a Segmentation Counter Pruning strategy to improve the accuracy of semantic scene representation. Extensive experiments on both synthetic and real-world datasets demonstrate the effectiveness of our method in scene understanding, tracking, and mapping, achieving 10× faster semantic rendering and 2× lower storage costs compared to existing methods.

Environments

Install requirements

conda create -n opengsslam python==3.9
conda activate opengsslam
conda install pytorch==2.0.0 torchvision==0.15.0 torchaudio==2.0.0 pytorch-cuda=11.8 -c pytorch -c nvidia
pip install -r requirements.txt

Install submodules

conda activate opengsslam
pip install submodules/diff-gaussian-rasterization
pip install submodules/simple-knn

Scene Interaction Demo

1. Download our pre-constructed Semantic 3D Gaussian scenes for the Replica dataset from the following link: Driver

2. Scene Interaction

python ./final_vis.py --scene_npz [download_path]/room1.npz

Here, users can click on any object in the scene to interact with it and use our Gaussian Voting method for real-time semantic rendering. Note that we use the pynput library to capture mouse clicks, which retrieves the click position on the entire screen. To map this position to the display window, we subtract an offset (x_off, y_off), representing the window’s top-left corner on the screen. All tests were conducted on an Ubuntu system with a 2K resolution.

Key Press Description

T: Toggle between color and label display modes.
J: Toggle between showing all objects or a single object.
K: Capture the current view.
A: Translate the object along the x-axis by +0.01.
S: Translate the object along the y-axis by +0.01.
D: Translate the object along the z-axis by +0.01.
Z: Translate the object along the x-axis by -0.01.
X: Translate the object along the y-axis by -0.01.
C: Translate the object along the z-axis by -0.01.
F: Rotate the object around the x-axis by +1 degree.
G: Rotate the object around the y-axis by +1 degree.
H: Rotate the object around the z-axis by +1 degree.
V: Rotate the object around the x-axis by -1 degree.
B: Rotate the object around the y-axis by -1 degree.
N: Rotate the object around the z-axis by -1 degree.
O: Output the current camera view matrix.
M: Switch to the next mapping camera view.
L: Increase the scale of all Gaussians.
P: Downsample Gaussians using a voxel grid.

SLAM Source Code

Coming soon!

Acknowledgement

We sincerely thank the developers and contributors of the many open-source projects that our code is built upon.

Citation

If you find our paper and code useful, please cite us:

@article{yang2025opengs,
  title={OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding},
  author={Yang, Dianyi and Gao, Yu and Wang, Xihan and Yue, Yufeng and Yang, Yi and Fu, Mengyin},
  journal={arXiv preprint arXiv:2503.01646},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
configs		configs
media		media
scene		scene
submodules		submodules
.gitignore		.gitignore
LICENSE		LICENSE
LICENSE copy		LICENSE copy
README.md		README.md
final_vis.py		final_vis.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding

Dianyi Yang, Yu Gao, Xihan Wang, Yufeng Yue, Yi Yang∗, Mengyin Fu

Video | Project Page

Environments

Scene Interaction Demo

1. Download our pre-constructed Semantic 3D Gaussian scenes for the Replica dataset from the following link: Driver

2. Scene Interaction

Key Press Description

SLAM Source Code

Acknowledgement

Citation

About

Releases

Packages

Languages

License

YOUNG-bit/open_semantic_slam

Folders and files

Latest commit

History

Repository files navigation

OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding

Dianyi Yang, Yu Gao, Xihan Wang, Yufeng Yue, Yi Yang∗, Mengyin Fu

Video | Project Page

Environments

Scene Interaction Demo

1. Download our pre-constructed Semantic 3D Gaussian scenes for the Replica dataset from the following link: Driver

2. Scene Interaction

Key Press Description

SLAM Source Code

Acknowledgement

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages