Single Image Based Three-Dimensional Scene Reconstruction Using Semantic and Geometric Priors

Gang-Joon Yoon,Jinjoo Song,Yu-Jin Hong,Sang Min Yoon
DOI: https://doi.org/10.1007/s11063-022-10780-2
IF: 2.565
2022-03-03
Neural Processing Letters
Abstract:Single image based three-dimensional (3D) scene reconstruction has become an important research topic for computer vision and computer graphics fields to provide machine vision systems with near human visual perception. Previous approaches for 3D scene reconstruction and depth estimation from single images required many factors including motion parallax, stereoscopic parallels, and various monocular depth cues adopted from known geometric priors. Deep learning based depth estimation techniques have advanced single image based depth estimation by aggregating various complexity information from RGB depth image datasets for training images to drive the process. This paper proposes an effective 3D scene estimation methodology by automatically extracting vanishing point and semantic information including 3D geometric characteristics without prior assumptions. The vanishing point is extracted from line segments and minimum spanning tree clustering to remove spurious noisy edges. Retracting geometric and semantic information from a given image is achieved by a generative adversarial network trained on the created training set. We verified the proposed approach’s efficiency and effectiveness experimentally with a large database created by directly recovering 3D scenes from from an input image.
computer science, artificial intelligence
What problem does this paper attempt to address?