GPDF-Net: geometric prior-guided stereo matching with disparity fusion refinement

Qi Zhao,Congxuan Zhang,Zhibo Rao,Zhen Chen,Zige Wang,Ke Lu
DOI: https://doi.org/10.1007/s00371-024-03459-3
IF: 2.835
2024-06-06
The Visual Computer
Abstract:Stereo matching is a popular topic in the image processing and computer vision fields. Although deep learning-based stereo matching approaches have achieved remarkable performance with respect to both estimation accuracy and computation efficiency, textureless and occluded regions and regions with edge blurring remain significant challenges for most existing stereo matching methods. To address these issues, we propose a geometric prior-guided stereo matching method with disparity fusion refinement, named GPDF-Net, in this paper. First, we exploit a geometric prior guidance module in the feature extraction part to enable features to obtain the global information of cross-view interactions and pay attention to the structural information contained in the input image. Second, we construct cost volume with disparity fusion refinement that introduces disparity-related geometric features to the concatenation volume and suppresses its redundant information to provide a better similarity measure. Third, we explore a method for replacing 3D convolution, making the cost aggregation module lighter and more efficient. Finally, we compare the proposed method with several state-of-the-art approaches on the Scene Flow and KITTI test databases. The experimental results demonstrate that the proposed method achieves competitive performance with respect to both accuracy and robustness, and it produces better results than those of other methods when given blurred edges and textureless and occluded areas. (The code is available at https://github.com/PCwenyue.)
computer science, software engineering
What problem does this paper attempt to address?