This is the implementation of the paper: ACVNet: Attention Concatenation Volume for Accurate and Efficient Stereo Matching, CVPR 2022, Gangwei Xu, Junda Cheng, Peng Guo, Xin Yang [Arxiv]
An informative and concise cost volume representation is vital for stereo matching of high accuracy and efficiency. In this paper, we present a novel cost volume construction method which generates attention weights from correlation clues to suppress redundant information and enhance matching-related information in the concatenation volume. To generate reliable attention weights, we propose multi-level adaptive patch matching to improve the distinctiveness of the matching cost at different disparities even for textureless regions.
- Python 3.8
- Pytorch 1.10
conda create -n acvnet python=3.8
conda activate acvnet
conda install pytorch torchvision torchaudio cudatoolkit=11.1 -c pytorch -c nvidia
pip install opencv-python
pip install tensorboard
pip install matplotlib
pip install tqdm
Download Scene Flow Datasets, KITTI 2012, KITTI 2015
As an example, use the following command to train ACVNet on Scene Flow
python main.py
The pretrained model on Scene Flow Datasets is saved in ./checkpoints/model_sceneflow.ckpt
Method | D1-bg (All) | D1-fg (All) | D1-all (All) | Runtime (s) |
---|---|---|---|---|
ACVNet | 1.37 % | 3.07 % | 1.65 % | 0.20 |
LEAStereo | 1.40 % | 2.91 % | 1.65 % | 0.30 |
GwcNet | 1.74 % | 3.93 % | 2.11 % | 0.32 |
PSMNet | 1.86 % | 4.62 % | 2.32 % | 0.41 |
If you find this project helpful in your research, welcome to cite the paper.
@article{xu2022ACVNet,
title={ACVNet: Attention Concatenation Volume for Accurate and Efficient Stereo Matching},
author={Gangwei Xu, Junda Cheng, Peng Guo, Xin Yang},
journal={arXiv:2203.02146},
year={2022}
}
Thanks to Xiaoyang Guo for opening source of his excellent work GwcNet. Our work is inspired by this work and part of codes are migrated from GwcNet.