Image Representation Based on Multiple Visual Codebooks

Yan SONG,Bing JIANG,Li-Rong DAI
DOI: https://doi.org/10.3969/j.issn.1003-6059.2013.10.002
2013-01-01
Abstract:The effectiveness of the image representation based on bag-of-visual words( BoW) model is majorly limited by the quantization error. To address this issue, an improved image representation based on multiple visual codebooks is proposed in this paper, which considers both visual codebook construction and feature coding. The proposed method specifically consists of 1 ) multiple visual codebooks construction, in which the compact and complementary visual codebooks are iteratively generated; 2 ) image representation, in which the visual words are firstly selected from each individual visual codebook, then the coding coefficients are determined by using the regularized linear regression method, and finally the image is represented by combining the spatial pyramid structure. The experimental results on several benchmark image classification datasets demonstrate the consistent and significant improvement of the proposed method.
What problem does this paper attempt to address?