End-to-end Learning of Self-Rectification and Self-Supervised Disparity Prediction for Stereo Vision

Xuchong Zhang,Yongli Zhao,Hang Wang,Han Zhai,Hongbin Sun,Nanning Zheng
DOI: https://doi.org/10.1016/j.neucom.2022.04.095
IF: 6
2022-01-01
Neurocomputing
Abstract:Stereo rectification and stereo matching are two critical components for the practical application of stereo vision systems. Previous studies treat them as two individual issues. For stereo rectification, var-ious traditional algorithms are proposed to estimate homography transformations, but the performance and the efficiency are unsatisfactory for real-time deployment. For stereo matching, disparity accuracy has been largely improved by learning based methods. However, the input data of all previous stereo net-works are assumed to be a pair of offline pre-rectified images, making them invalidate for accurate matching when the stereo vision system suffers from mechanical misalignment due to external collisions or temperature variations. In this paper, we optimize these two components jointly and propose an end -to-end learning framework to achieve online self-rectification and self-supervised disparity prediction simultaneously. The overall network contains two cascaded subnetworks which enable stereo rectifica-tion and stereo matching sequentially for a pair of unrectified images. The experimental results are eval-uated on both publicly available datasets and realistic scenarios. Evaluation results demonstrate that, the proposed network produces state-of-the-art results for self-rectification in terms of computation accu-racy and speed, and also produces competitive disparity results with previous self-supervised methods. Therefore, the proposed design provides a more practical and efficient solution for stereo vision systems deployed on mobile platforms.(c) 2022 Elsevier B.V. All rights reserved.
What problem does this paper attempt to address?