Indoor Scene Classification by Incorporating Predicted Depth Descriptor.

Yingbin Zheng,Jian Pu,Hong Wang,Hao Ye
DOI: https://doi.org/10.1007/978-3-319-77383-4_2
2017-01-01
Abstract:Depth cue is crucial for perception of spatial layout and understanding the cluttered indoor scenes. However, there is little study of leveraging depth information within the image scene classification systems, mainly because the lack of depth labeling in existing monocular image datasets. In this paper, we introduce a framework to overcome this limitation by incorporating the predicted depth descriptor of the monocular images for indoor scene classification. The depth prediction model is firstly learned from existing RGB-D dataset using the multiscale convolutional network. Given a monocular RGB image, a representation encoding the predicted depth cue is generated. This predicted depth descriptors can be further fused with features from color channels. Experiments are performed on two indoor scene classification benchmarks and the quantitative comparisons demonstrate the effectiveness of proposed scheme.
What problem does this paper attempt to address?