Attention-guided Multi-view Stereo Network for Depth Estimation

Penghui Sun,Suping Wu,Kui Lin
DOI: https://doi.org/10.1109/hpcc-smartcity-dss50907.2020.00106
2020-01-01
Abstract:The purpose of the Multi-View Stereo is to restore the target 3D geometric model from multi-perspective images. There are several problems with the existing approaches based on deep learning, such as missing the detailed information in the predicted depth map, the low surface accuracy, and the incomplete reconstructed 3D point cloud model. In order to overcome these problems, we propose the Attention-guided Multiview Stereo Network For 3D Depth Estimation(AG-MVSNet). We combine the camera geometry with the deep neural network. And we adopt the coarse-to-fine deep learning framework to restore the target 3D geometry model. High-quality detailed feature information has an important influence on multi-view 3D reconstruction, and reference images in the natural environment contain detailed feature information which is needed in the reconstruction process. Therefore, we use the detailed feature information from different scales of reference images to restore the lost details of the high-level features. The quantitative and qualitative experimental results show that the proposed algorithm is more complete than the common multi-view 3D reconstruction algorithms.
What problem does this paper attempt to address?