Cost Volume Aggregation in Stereo Matching Revisited: A Disparity Classification Perspective

Yun Wang,Longguang Wang,Kunhong Li,Yongjian Zhang,Dapeng Oliver Wu,Yulan Guo
DOI: https://doi.org/10.1109/tip.2024.3484251
IF: 10.6
2024-11-25
IEEE Transactions on Image Processing
Abstract:Cost aggregation plays a critical role in existing stereo matching methods. In this paper, we revisit cost aggregation in stereo matching from disparity classification and propose a generic yet efficient Disparity Context Aggregation (DCA) module to improve the performance of CNN-based methods. Our approach is based on an insight that a coarse disparity class prior is beneficial to disparity regression. To obtain such a prior, we first classify pixels in an image into several disparity classes and treat pixels within the same class as homogeneous regions. We then generate homogeneous region representations and incorporate these representations into the cost volume to suppress irrelevant information while enhancing the matching ability for cost aggregation. With the help of homogeneous region representations, efficient and informative cost aggregation can be achieved with only a shallow 3D CNN. Our DCA module is fully-differentiable and well-compatible with different network architectures, which can be seamlessly plugged into existing networks to improve performance with small additional overheads. It is demonstrated that our DCA module can effectively exploit disparity class priors to improve the performance of cost aggregation. Based on our DCA, we design a highly accurate network named DCANet, which achieves state-of-the-art performance on several benchmarks.
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?