Web image interpretation: semi-supervised mining annotated words

Fei Wu,Dingyi xia,Yueting Zhuang,Hanwang Zhang,Wenhao Liu
DOI: https://doi.org/10.1109/ICME.2009.5202791
2009-01-01
Abstract:An image is worth of thousand words. Automatic Web image annotation is a practical and effective way for both Web image retrieval and image understanding. However, current annotation techniques are very difficult to get natural language interpretation for images such as ldquopandas eat bamboordquo. In this paper, we proposed an approach to interpret image semantics through semi-supervised mining annotated words. The idea in this approach mainly consists of three parts: at first, the visibility of annotated words of target image is calculated by semi-supervised learning approach from the landmark words in WordNet; then the annotated words are used as queries to retrieve matched Web pages; at last, the meaningful sentences in the matched Web pages are ranked as the interpretation of target image by semi-supervised learning approach. Experiments conducted on real-world Web images demonstrate the effectiveness of the proposed approach.
What problem does this paper attempt to address?