Image Classification Using Rbm To Encode Local Descriptors With Group Sparse Learning

Jinzhu Wang,Wenmin Wang,Ronggang Wang,Wen Gao
DOI: https://doi.org/10.1109/ICIP.2015.7350932
2015-01-01
ICIP
Abstract:This paper proposes to employ deep learningmodel to encode local descriptors for image classification. Previous works using deep architectures to obtain higher representations are often operated from pixel level, which lack the power to be generalized to large-size and complex images due to computational burdens and internal essence capture. Our method slips the leash of this limitation by starting from local descriptors to leverage more semantical inputs. We investigate to use two layers of Restricted Boltzmann Machines (RBMs) to encode different local descriptors with a novel group sparse learning (GSL) inspired by the recent success of sparse coding. Besides, unlike the most existing pure unsupervised feature coding strategies, we use another RBM corresponding to semantic labels to perform supervised fine-tuning which makes our model more suitable for classification task. Experimental results on Caltech-256 and Indoor-67 datasets demonstrate the effectiveness of our method.
What problem does this paper attempt to address?