Official implementation of the CVPR 2026 paper "SonoWorld: From One Image to a 3D Audio-Visual Scene."
Code & dataset coming soon!
If you find our work useful, please cite:
@article{jin2026sonoworld,
title={SonoWorld: From One Image to a 3D Audio-Visual Scene},
author={Jin, Derong and Chen, Xiyi and Lin, Ming C. and Gao, Ruohan},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2026}
}