Efficient Backbone Architecture Search for Stereo Depth Estimation in Autonomous Driving

Xuchong Zhang,He Dai,Jianing Chen,Hongbin Sun
DOI: https://doi.org/10.1109/itsc55140.2022.9922562
2022-01-01
Abstract:Recent advances in AutoML have extended Neural Architecture Search (NAS) beyond image classification to optimize dense prediction tasks. However, the existing works are inappropriate to search efficient backbone for deep learning based stereo matching, because their search spaces are not custom-designed according to the inherent requirements of the pixel-wise depth prediction. This paper proposes a differentiable architecture search specific for efficient stereo network backbone. In particular, the proposed method jointly optimizes the micro-architecture and the macro-architecture to search distinct cell structures and adaptive low-level features for stereo network backbone. The target architecture can be found within 3 GPU days using gradient-based optimization. The evaluation results on stereo datasets demonstrate that, by simply replacing the hand-crafted feature extraction with the searched backbone in a vanilla framework, the proposed network obtains much better disparity accuracy than the designs using existing NAS methods, and even achieves comparable performance compared with the state-of-the-art stereo networks that integrate various elaborate modules. Hence, the proposed NAS method is an efficient way to automate the stereo network architecture engineering.
What problem does this paper attempt to address?