Three Orthogonal Vanishing Points Estimation In Structured Scenes Using Convolutional Neural Networks

Yongjie Shi,Danfeng Zhang,Jingsi Wen,Xin Tong,He Zhao,Xianghua Ying,Hongbin Zha
DOI: https://doi.org/10.1109/icip.2019.8804405
2019-01-01
Abstract:Inferring 3D geometric cues is a crucial step, whereas vanishing point plays a very important role in image understanding from a single image of structured scenes. In this paper, we construct a 330-thousand-item image database of structured scenes labeled by vanishing points, focal length and camera orientation. We grab over 300 thousand Google Street View images which cover the downtown and neighboring areas of New York, Los Angeles, Chicago and etc. The prediction error is characterized by a loss function by imposing a regularization item derived from the geometric constraint of orthogonal vanishing points and focal length. Moreover, we collect about 30 thousand indoor images using a full 360-degree panorama camera taken by ourselves in room, office, library and etc. We also using Convolutional Neural Networks to transfer learning from street view images to indoor images. Extensive experiments demonstrate that our algorithm outperforms state-of-the-art non-learned approaches.
What problem does this paper attempt to address?