Abstract:To estimate depth maps from monocular videos in a self-supervised way, existing methods simultaneously predict the pose changes between adjacent frames and the depth maps of each frame, and then reconstruct the forward or backward frames using them, thereby casting depth estimation as a frame reconstruction problem. The corresponding reconstruction loss, which serves as a key supervision signal for training the whole network, can adversely affect the depth estimation accuracy if it is not properly established. In this paper, we propose a novel self-supervised monocular depth estimation method from videos via adaptive reconstruction constraints, i.e., designing the loss functions by establishing more accurate reconstruction constraints. Specifically, we first propose a pose-adaptive reconstruction loss to adaptively select the optimal pose parameterizations that yield the minimum reconstruction errors, reducing the impact of inaccurate posture on frame reconstruction. Then, we propose a region-sensitive reconstruction loss that fully utilizes the pretrained image reconstruction model to adaptively identify the poorly reconstructed regions and characterize the deviation of these regions on feature space. Finally, we additionally construct a multi-frame depth estimation network and design a reconstruction-guided bidirectional distillation loss to adaptively adjust the direction of distillation between networks of multi-frame and monocular depth estimation based on their current reconstruction quality, which encourages them to learn from each other and benefits the core task of monocular depth estimation. With our proposed losses, we achieve superior performance in comparison with state-of-the-art methods on benchmark datasets.

Self-supervised monocular depth estimation in dynamic scenes with moving instance loss

Monocular Depth Estimation Based on Unsupervised Learning

A Depth Estimation Framework Based on Unsupervised Learning and Cross-Modal Translation

Unsupervised Monocular Depth Perception: Focusing on Moving Objects

Self-Supervised Monocular Depth Estimation With Self-Perceptual Anomaly Handling

Enhancing Self-supervised Monocular Depth Estimation Via Incorporating Robust Constraints.

Monocular Depth Estimation Using Self-Supervised Learning with More Effective Geometric Constraints

Mining Supervision for Dynamic Regions in Self-Supervised Monocular Depth Estimation

3D Object Aided Self-Supervised Monocular Depth Estimation

Disentangling Object Motion and Occlusion for Unsupervised Multi-frame Monocular Depth

SC-DepthV3: Robust Self-supervised Monocular Depth Estimation for Dynamic Scenes

Monocular Depth Estimation via Self-Supervised Self-Distillation

Self-supervised Monocular Depth Estimation with Multi-Scale Structure Similarity Loss

Self-supervised Monocular Depth Estimation with Self-Distillation and Dense Skip Connection

Self-Supervised Monocular Depth Estimation with Binary Mask and Lightweight Network

Self-Supervised Monocular Depth Estimation with Multi-constraints

D^3epth: Self-Supervised Depth Estimation with Dynamic Mask in Dynamic Scenes

RM-Depth: Unsupervised Learning of Recurrent Monocular Depth in Dynamic Scenes

Self-supervised monocular depth estimation via joint attention and intelligent mask loss

Research on Self-Supervised Depth Estimation Algorithm of Driving Scene Based on Monocular Vision.

Self-Supervised Monocular Depth Estimation from Videos Via Adaptive Reconstruction Constraints