Abstract:Recent learning-based methods demonstrate their strong ability to estimate depth for multi-view stereo reconstruction. However, most of these methods directly extract features via regular or deformable convolutions, and few works consider the alignment of the receptive fields between views while constructing the cost volume. Through analyzing the constraint and inference of previous MVS networks, we find that there are still some shortcomings that hinder the performance. To deal with the above issues, we propose an Epipolar-Guided Multi-View Stereo Network with Interval-Aware Label (EI-MVSNet), which includes an epipolar-guided volume construction module and an interval-aware depth estimation module in a unified architecture for MVS. The proposed EI-MVSNet enjoys several merits. First, in the epipolar-guided volume construction module, we construct cost volume with features from aligned receptive fields between different pairs of reference and source images via epipolar-guided convolutions, which take rotation and scale changes into account. Second, in the interval-aware depth estimation module, we attempt to supervise the cost volume directly and make depth estimation independent of extraneous values by perceiving the upper and lower boundaries, which can achieve fine-grained predictions and enhance the reasoning ability of the network. Extensive experimental results on two standard benchmarks demonstrate that our EI-MVSNet performs favorably against state-of-the-art MVS methods. Specifically, our EI-MVSNet ranks on both intermediate and advanced subsets of the Tanks and Temples benchmark, which verifies the high precision and strong robustness of our model.

Cost Volume Pyramid Based Depth Inference for Multi-View Stereo

Non-parametric Depth Distribution Modelling based Depth Inference for Multi-view Stereo

Attention Aware Cost Volume Pyramid Based Multi-view Stereo Network for 3D Reconstruction

MVSNet: Depth Inference for Unstructured Multi-view Stereo

Adaptive Depth Estimation for Pyramid Multi-View Stereo.

Multi-View Stereo Network with attention thin volume

Multi-View Stereo with Learnable Cost Metric

EPP-MVSNet: Epipolar-assembling based Depth Prediction for Multi-view Stereo

Efficient Multi-view Stereo by Dynamic Cost Volume and Cross-scale Propagation

Multi-View Stereo Representation Revist: Region-Aware MVSNet

3DVNet: Multi-View Depth Prediction and Volumetric Refinement

Efficient Multi-view Stereo by Iterative Dynamic Cost Volume

Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching

DSC-MVSNet: attention aware cost volume regularization based on depthwise separable convolution for multi-view stereo

Adaptive Cost Aggregation in Iterative Depth Estimation for Efficient Multi-view Stereo.

OD-MVSNet: Omni-dimensional dynamic multi-view stereo network

EI-MVSNet: Epipolar-Guided Multi-View Stereo Network With Interval-Aware Label

RIAV-MVS: Recurrent-Indexing an Asymmetric Volume for Multi-View Stereo

HC-MVSNet: A Probability Sampling-Based Multi-View-stereo Network with Hybrid Cascade Structure for 3D Reconstruction

DRI-MVSNet: A Depth Residual Inference Network for Multi-View Stereo Images

Recurrent Mvsnet For High-Resolution Multi-View Stereo Depth Inference