M2Depth: A Novel Self-Supervised Multi-Camera Depth Estimation with Multi-Level Supervision

Ruihang Li,Shanding Ye,Zhe Yin,Tao Li,Zehua Zhang,KaiKai Xiao,Zhijie Pan
DOI: https://doi.org/10.1109/icme57554.2024.10687803
2024-01-01
Abstract:The essence of self-supervised depth estimation based on multi-camera systems resides in effectively harnessing the valuable and infrequent key points that offer self-supervisory information. To this end, we introduce a novel multi-camera self-supervised depth estimation framework that incorporates multilevel supervision, encompassing both local explicit and global implicit dimensions. Specifically, we construct local explicit supervision by utilizing the temporally invariant key-point positions and the distances between pairs of keypoints. Concurrently, we introduce D-PoseNet to establish global implicit supervision, leveraging the cross-modal global attention mechanism to integrate depth estimation results with inter-frame transformation matrices, facilitating their iterative optimization. We conducted experiments on two challenging multi-camera depth estimation datasets for autonomous driving, DDAD and nuScenes. The experimental results indicate that our method achieves state-of-the-art performance on both datasets.
What problem does this paper attempt to address?