Self-supervised coarse-to-fine monocular depth estimation using a lightweight attention module

Yuanzhen Li,Fei Luo,Chunxia Xiao
DOI: https://doi.org/10.1007/s41095-022-0279-3
IF: 4.1268
2022-06-16
Computational Visual Media
Abstract:Self-supervised monocular depth estimation has been widely investigated and applied in previous works. However, existing methods suffer from texture-copy, depth drift, and incomplete structure. It is difficult for normal CNN networks to completely understand the relationship between the object and its surrounding environment. Moreover, it is hard to design the depth smoothness loss to balance depth smoothness and sharpness. To address these issues, we propose a coarse-to-fine method with a normalized convolutional block attention module (NCBAM). In the coarse estimation stage, we incorporate the NCBAM into depth and pose networks to overcome the texture-copy and depth drift problems. Then, we use a new network to refine the coarse depth guided by the color image and produce a structure-preserving depth result in the refinement stage. Our method can produce results competitive with state-of-the-art methods. Comprehensive experiments prove the effectiveness of our two-stage method using the NCBAM.
computer science, software engineering
What problem does this paper attempt to address?