Abstract:Depth map is the basic requirement for all three-dimensional (3D) applications, but facing sensor noises, low frame-rate and low resolution in the procedure of data acquisition, especially in multiview cases. These problems bring obstacles to high quality 3D applications. Among the existing approaches, depth propagation is one of the promise approaches, and it can be utilized in temporal or spatial manner. However, propagation based algorithms process one aspect of the mentioned problems to pursuit local optimal solution. Actually, the process chain of depth map is from capture to application, and the optimization should be coupled instead of mutually independent. In this paper, we proposed a bundled-optimization scheme to process the thorough chain from capture to multiview dense depth map generation for the 3D applications. In this scheme, sensor noises in the captured low-resolution depth map are first detected and removed through a frequency-counting based non-linear filter. The filter refrains from the noise amplification in the procedure of depth map up-sampling. Low-pass blurring effect around high frequency areas is the by-product in up-sampling, and it is hard to detect in the depth map. We therefore propose a Blocklet based depth map optimization method for this blurring effect, and the accuracy of the high resolution depth map is then improved. Temporal depth propagation is then utilized on the optimized depth maps through the optical flow field rectified by temporal and spatial constrains. After that, a multi-set graph cut model is proposed to synthesize the multiview dense depth map. The experimental results indicate that our scheme can achieve at least 13.2575% PSNR gains when comparing to the benchmark depth map synthesis methods, and suggest the effectiveness of the proposed bundled-optimization method.

A Novel Sparse-to-dense Depth Map Generation Framework for Monocular Videos

A Depth Estimation Framework Based on Unsupervised Learning and Cross-Modal Translation

MFF-Net: Towards Efficient Monocular Depth Completion With Multi-Modal Feature Fusion

Dense Reconstruction from Monocular Slam with Fusion of Sparse Map-Points and Cnn-Inferred Depth.

Sparse Depth Densification for Monocular Depth Estimation

Real Time Complete Dense Depth Reconstruction for a Monocular Camera

Temporally Consistent Depth Map Estimation Based On 3d-Mrf

DRM-SLAM: Towards Dense Reconstruction of Monocular SLAM with Scene Depth Fusion.

Spatio-Temporal Depth Recovery of Dynamic Scenes with Multiple Handheld Cameras

High-Quality Depth Recovery Via Interactive Multi-view Stereo

Unsupervised Monocular Depth Estimation Based on Hierarchical Feature-Guided Diffusion

DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation

MonoDiffusion: Self-Supervised Monocular Depth Estimation Using Diffusion Model

Boosting Monocular Depth Estimation with Sparse Guided Points

Towards 3D Scene Reconstruction from Locally Scale-Aligned Monocular Video Depth

CNN-MonoFusion: Online Monocular Dense Reconstruction Using Learned Depth from Single View

Monocular Depth Estimation using Diffusion Models

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Sparse-to-Dense Depth Estimation in Videos Via High-Dimensional Tensor Voting

Novel View Synthesis Using Feature-Preserving Depth Map Resampling

A Bundled-Optimization Model of Multiview Dense Depth Map Synthesis for Dynamic Scene Reconstruction