Perceptually Learning Multi-View Sparse Representation for Scene Categorization.

Weibin Yin,Dongsheng Xu,Zheng Wang,Zhijun Zhao,Chao Chen,Yiyang Yao
DOI: https://doi.org/10.1016/j.jvcir.2019.01.002
IF: 2.887
2019-01-01
Journal of Visual Communication and Image Representation
Abstract:Utilizing multi-channel visual features to characterize scenery images is standard for state-of-the-art scene recognition systems. However, how to encode human visual perception for scenery image modeling and how to optimally combine visual features from multiple views remains a tough challenge. In this paper, we propose a perceptual multi-view sparse learning (PMSL) framework to distinguish sceneries from different categories. Specifically, we first project regions from each scenery into the so-called perceptual space, which is established by combining human gaze behavior, color and texture. Afterward, a novel PMSL is developed which fuzes the above three visual cues into a sparse representation. PMSL can support absent channel visual features, which is frequently occurred in practical circumstances. Finally, the sparse representation from each scenery image is incorporated into an image kernel, which is further fed into a kernel SVM for scene categorization. Comprehensive experimental results on popular data sets have demonstrated the superiority of our method over well-known shallow/deep recognition models.
What problem does this paper attempt to address?