Ranking-Based Vocabulary Pruning In Bag-Of-Features For Image Retrieval

Fan Zhang,Yang Song,Weidong Cai,Alexander G. Hauptmann,Sidong Liu,Siqi Liu,David Dagan Feng,Mei Chen
DOI: https://doi.org/10.1007/978-3-319-14803-8_34
2015-01-01
Abstract:Content-based image retrieval (CBIR) has been applied to a variety of medical applications, e.g., pathology research and clinical decision support, and bag-of-features (BOF) model is one of the most widely used techniques. In this study, we address the problem of vocabulary pruning to reduce the influence from the redundant and noisy visual words. The conditional probability of each word upon the hidden topics extracted using probabilistic Latent Semantic Analysis (pLSA) is firstly calculated. A ranking method is then proposed to compute the significance of the words based on the relationship between the words and topics. Experiments on the publicly available Early Lung Cancer Action Program (ELCAP) database show that the method can reduce the number of words required while improving the retrieval performance. The proposed method is applicable to general image retrieval since it is independent of the problem domain.
What problem does this paper attempt to address?