A multidimensional fusion image stereo matching algorithm

Zhenhua Quan,Liang Luo,Bin Wu
DOI: https://doi.org/10.1049/ipr2.13072
IF: 2.3
2024-03-13
IET Image Processing
Abstract:Here, we address the issue of low matching accuracy of stereo matching algorithms in specular regions of images and propose a multi‐attention‐based stereo matching algorithm called MFANet. Experimental results on KITTI2015 dataset demonstrate that the MFANet algorithm is less affected by specular reflections compared to the baseline PSMNet algorithm. Comparative experiments on the specular region of KITTI2012 dataset show that the proposed algorithm achieves more accurate disparity prediction results in specular pathological regions. In response to the low matching accuracy of stereo matching algorithms in image regions with specular reflection, this paper proposes a multidimensional fusion stereo matching algorithm named MFANet. The algorithm embeds a multispectral attention module into the residual feature extraction network, utilizing two‐dimensional discrete cosine transforms to extract frequency features. In the pyramid pooling module, a coordinated attention mechanism is introduced to capture relevant positional information. In the cost aggregation part, the MFANet algorithm incorporates a three‐dimensional attention mechanism, focusing on the more important semantic information in high‐level features. By combining detailed information from low‐level features, semantic information from high‐level features, and contextual information, the algorithm generates features that are more conducive to disparity prediction. The MFANet algorithm is evaluated on three standard datasets (SceneFlow, KITTI2015, and KITTI2012). Experimental results demonstrate its robustness against specular reflection interference, accurate prediction of disparities in specular reflection pathological regions, and promising application prospects.
computer science, artificial intelligence,engineering, electrical & electronic,imaging science & photographic technology
What problem does this paper attempt to address?