Codebook Enhancement of Vlad Representation for Visual Recognition.

Zhe Wang,Yali Wang,Limin Wang,Yu Qiao
DOI: https://doi.org/10.1109/icassp.2016.7471878
2016-01-01
Abstract:Recent studies demonstrate the effectiveness of super vector representation in a number of visual recognition tasks. One popular approach along this line is the Vector of Locally Aggregated Descriptor (VLAD) where the super vector is encoded with a codebook generated by k-means. However, the effectiveness of the codebook is often limited, due to the poor clustering solution, the high dimensionality of visual descriptors and the global PCA for data preprocessing. To circumvent these problems, we propose three approaches for codebook enhancement, (i) partition of data, (ii) partition of feature, and (iii) local PCA. Moreover, all these approaches can be effectively integrated together to further boost the recognition performance. In our experiments, we evaluate our enhancement approaches on two challenging visual tasks, i.e., action recognition (HMDB51) and object recognition (PASCAL VOC2007). The results show that our approaches and the fusion versions significantly outperform the baselines.
What problem does this paper attempt to address?