Vocabulary Hierarchy Optimisation Based on Spatial Context and Category Information

Zhiguo Yang,Yuxin Peng,Jianguo Xiao
DOI: https://doi.org/10.1504/ijmis.2013.056470
2013-01-01
International Journal of Multimedia Intelligence and Security
Abstract:In this paper, we focus on the hierarchy and discriminating ability of visual vocabulary.We propose to use the category information of images and the spatial context of keypoints to select appropriate visual words from different hierarchical levels.Existing approaches, such as flat vocabulary and vocabulary tree, can change the hierarchy of all visual words at the same time, by setting different cluster numbers and tree height respectively.However, the most appropriate visual words may be at different hierarchical levels, and existing approaches could not adjust the hierarchy of different visual words separately.To address this problem, we propose an object function to describe the consistence of visual words, with category information of images and spatial context of keypoints, and then we adopt simulated annealing algorithm to search for a sub-optimal solution, which corresponds to a visual vocabulary selected from the vocabulary tree.Different from existing methods, the proposed approach can select the most appropriate visual words from different levels adaptively, which can improve the performances in image annotation and classification tasks.Experiments on widely-used 15-scenes dataset demonstrate the effectiveness of the proposed approach.
What problem does this paper attempt to address?