Weakly Supervised PatchNets : Learning Aggregated Patch Descriptors for Scene Recognition

Zhe Wang,Limin Wang,Yali Wang,Bowen Zhang,Yu Qiao,Charless Fowlkes
2017-01-01
Abstract:In this paper, we propose a hybrid representation, which leverages the great discriminative capacity of CNNs and the efficiency of descriptor encoding scheme scene recognition. We make three main contributions. First, we train an end-to-end PatchNet in a weakly supervised manner, in order to extract the discriminative deep descriptors of local patches. Second, we design a novel VSAD encoding approach. With the help of semantic predictions from PatchNet, it can effectively aggregate deep local-patch descriptors into a global image representation. Finally, we evaluate our approach on two standard scene recognition benchmarks to show the effectiveness, i.e., MIT Indoor67 (86.2%) and SUN397 (73.0%).
What problem does this paper attempt to address?