Yuhan Zhang*
·
Long Zhuo*
·
Ziyang Chu*
·
Tong Wu†
·
Zhibing Li
·
Liang Pan†
·
Dahua Lin
·
Ziwei Liu†
*Equal contribution †Corresponding authors
[Project page]
[ArXiv]
[Leaderboard]
[Dataset]
Overview of Hi3DEval, a unified framework for 3D generation evaluation with three key components: 1) Hierarchical evaluation protocols that jointly assess object-level and partlevel quality, with extended material evaluation via reflectance cues. 2) A large-scale benchmark featuring a diverse set of 3D generative models and extensive human-aligned annotations generated via a multi-agent, multi-modal LLMs pipeline. 3) A hybrid automated scoring system that integrates video-based and naive 3D-based representations to enhance evaluators’ perceptions of 3D structure.
- Our system support object-level evaluation, which is used for leaderboard. The evaluation protocol is consisted with five dimensions,
Geomotry Plausibility,Geometry Details,Texture Quality,Geometry-Texture Coherenceand3D-Prompt Alignment. The detailed setup and usage can be found inobj_scoring/readme.md.
- Since existing benchmark solely support aesthetic-level texture judgments, which is coarse, we propose protocols tailered to explicitly assess core physical properties, such as albedo, saturation, and metallicness. The detailed setup and usage can be found in
material_scoring/README.md.
If our work is useful for your research, please consider citing our paper:
@article{zhang2025hi3deval,
title={Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity},
author={Zhang, Yuhan and Zhuo, Long and Chu, Ziyang and Wu, Tong and Li, Zhibing and Pan, Liang and Lin, Dahua and Liu, Ziwei},
journal={arXiv preprint arXiv:2508.05609},
year={2025}
}


