Good Practice on Deep Scene Classification: from Local Supervision to Knowledge Guided Disambiguation

Y. Qiao,Limin Wang,Sheng Guo,Zhe Wang,Weilin Huang,Yali Wang,Shenzhen
2017-01-01
Abstract:Recent years witness remarkable progresses of scene classification task, mainly with deep convolutional neural network trained on large scale datasets like Place [7]. This short paper summarize our recent works toward improving the performance of large scale scene classification with deep networks [4, 5, 1]. These works include: 1) encoding locally-supervised convolutional features for scene representation, 2) weakly training patch-level CNNs to extract local discriminative descriptors, and 3) exploiting knowledge from extra networks to release label ambiguity problem of scene categories. We will describe the key insights, approaches and results of our works with analysis on the connection and difference among them. Our methods achieves the second place at the Places2 challenge in ILSVRC 2015, and the first place at the LSUN challenge in CVPR 2016. One question we try to answer with this paper is how scene classification is related to and different from object classification. We hope these investigation and analysis will inspire novel ideas toward the challenging problem of complex scene understanding.
What problem does this paper attempt to address?