Web image clustering method based on image and text relevant mining

Yueting Zhuang,Fei Wu,Yahong Han
2009-01-01
Abstract:The invention discloses a web image clustering method based on image and text relevant mining, which comprises the following steps of: (1) extracting images and associated texts thereof in Google image searching results according to the query; (2) extracting nouns in the associated texts to form a vocabulary list; (3) calculating the visibility of words in the vocabulary list; the visibility and a TF-IDF method are integrated for calculating the relative association between the words and the images; (4) calculating the theme degree of association between any two words in the vocabulary list; (5) a complex map is used for modeling the relative association; (6) a complex map clustering arithmetic is applied for clustering the images. The method combines the visibility of the words and the TF-IDF method to define the relative association between the words and the images and breakthroughs the restriction that the TF-IDF as a text processing text can not directly measure the relation between the words and the images; by modeling the relative association between the words and the images and between the words by the complex map, a web image clustering frame is provided so that the image searching results are classified according to the theme, thus be convenient for searching by users.
What problem does this paper attempt to address?