A stereo matching network with a cascade spatial pyramid pooling (CSPP) substructure

Ting Tang,Zongqing Lu,Qingmin Liao
DOI: https://doi.org/10.1117/12.2539613
2019-01-01
Abstract:We propose a novel end-to-end supervised convolutional neural network(CNN) to compute disparity from a pair of stereo images. To solve the current problem of computing the high-quality disparity in ill-areas, our cascade spatial pyramid pooling (CSPP) substructure is able to gather global context information by aggregating the context information in different positions and different feature block scales from coarse to fine. We also introduce a warp layer, the right feature map is warped with the previously predicted disparity, and then is compared with the left feature map to form a cost volume. We learn the disparity from the cost volume with different level features information. We evaluate our method on three stereo datasets, and results show our method has advantages in textured areas, target edge areas and efficiency. We also achieve a high ranking performance.
What problem does this paper attempt to address?