DSSMNeRF: Depth Self-supervised MVS NeRF

Yixuan Tong,Gengsheng Chen,Wei Xu
DOI: https://doi.org/10.1109/asicon58565.2023.10396495
2023-01-01
Abstract:Due to the inability of capturing enough scene-specific details, existing view synthesis algorithms suffer largely from the trade off between the high-frequency information and the requirement of matching appropriate hyperparameters for different datasets. In this paper, we propose a novel Depth Self-supervised MVS NeRF (DSSMNeRF). As a generalizable model, DSSMNeRF learns geometry priors across different scenes and captures high frequency appearance in a single scene through self-supervised training, using only 3 images and depth maps. DSSMNeRF extracts geometry-aware features through plane-swept cost volumes, and then synthesizes diffuse color, blending color and final color sequentially. For self-supervised training, we propose two new loss functions, re-synthesis loss and pseudo-supervision loss, to provide pixel-level self-supervision at any viewpoint. And, to filter out the inappropriate supervisions, occlusion mask and blending mask are applied before computing pseudo-supervision loss. We validate DSSMNeRF on different scenes of the DTU dataset. Experimental results show that, in comparison with its peer works, DSSMNeRF reaches the best PSNR (19.19) in the three input views setting.
What problem does this paper attempt to address?