Jing He1, Haodong Li12✱, Mingzhi Sheng1✱, Ying-Cong Chen13✉
1HKUST(GZ)
2UC San Diego
3HKUST
✱Both authors contributed equally.
✉Corresponding author.
We present Lotus-2, a two-stage deterministic framework for monocular geometric dense prediction. Our method leverages pre-trained generative model as a deterministic world prior to achieve new state-of-the-art accuracy while requiring remarkably minimal data (trained on only 0.66% of the samples used by MoGe-2). This figure demonstrates Lotus-2's robust zero-shot generalization with sharp geometric details, especially in challenging cases like oil paintings and transparent objects.
🚀🚀🚀 Please also check the Project Page and Github Repo our prior work: Lotus! 🚀🚀🚀
- 2025-12-01: Paper released!
- 2025-11-28: The inference code and HuggingFace demo (Depth & Normal) are available!
This installation was tested on: Ubuntu 20.04 LTS, Python 3.10, CUDA 12.3, NVIDIA A800-SXM4-80GB.
- Be sure you have a GPU with at least 40GB memory.
- Clone the repository (requires git):
git clone https://github.com/EnVision-Research/Lotus-2.git cd Lotus-2 - Install dependencies (requires conda):
conda create -n lotus2 python=3.10 -y conda activate lotus2 pip install -r requirements.txt - Be sure you have access to
black-forest-labs/FLUX.1-dev. - Login your huggingface account via (if you want to switch account, run
hf auth logoutat first):hf auth login
- For depth estimation, run:
python app.py depth - For normal estimation, run:
python app.py normal
- Place your images in a directory, for example, under
./assets/in-the-wild_example(where we have already prepared several examples). - Run the inference command:
sh infer.sh
- Note: The inference code will automatically download the required model weights. You also can download them manually using the HuggingFace CLI:
Use the following arguments to specify the paths:
hf download jingheya/Lotus-2 --local-dir <path/to/your/local/directory>--core_predictor_model_path,--lcm_model_path, and--detail_sharpener_model_path.
- Prepare benchmark datasets:
- For depth estimation, please download the Marigold evaluation datasets via:
cd datasets/eval/depth/ wget -r -np -nH --cut-dirs=4 -R "index.html*" -P . https://share.phys.ethz.ch/~pf/bingkedata/marigold/evaluation_dataset/ - For normal estimation, please (manually) download the DSINE evaluation datasets (
dsine_eval.zip) under:datasets/eval/normal/and unzip it.
- Run the evaluation command (modify the
TASK_NAMEineval.shto switch tasks):sh eval.sh
If you find our work useful in your research, please consider citing our paper:
@article{he2025lotus,
title={Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model},
author={He, Jing and Li, Haodong and Sheng, Mingzhi and Chen, Ying-Cong},
journal={arXiv preprint arXiv:2512.01030},
year={2025}
}