Clustering web images by correlation mining of image-text
Fei Wu,HAN Ya-Hong,ZHUANG Yue-Ting,Jian Shao
DOI: https://doi.org/10.3724/SP.J.1001.2010.03704
2010-01-01
Ruan Jian Xue Bao/Journal of Software
Abstract:To cluster the retrieval results of Web image, a framework for the clustering is proposed in this paper. It explores the surrounding text to mine the correlations between words and images and therefore the correlations are used to improve clustering results. Two kinds of correlations, namely word to image and word to word correlations, are mainly considered. As a standard text process technique, tf-idf method cannot measure the correlation of word to image directly. Therefore, this paper proposes to combine tf-idf method with a feature of word, namely visibility, to infer the correlation of word to image. Through LDA model, it defines a topic relevance function to compute the weights of word to word correlations. Finally, complex graph clustering and spectral co-clustering algorithms are used to testify the effect of introducing visibility and topic relevance into image clustering. Encouraging experimental results are reported in this paper. © by Institute of Software, the Chinese Academy of Sciences. All rights reserved.