Abstract:Although it has been studied for years by the computer vision and machine learning communities, image annotation is still far from practical. In this paper, we propose a novel attempt at model-free image annotation, which is a data-driven approach that annotates images by mining their search results. Some 2.4 million images with their surrounding text are collected from a few photo forums to support this approach. The entire process is formulated in a divide-and-conquer framework where a query keyword is provided along with the uncaptioned image to improve both the effectiveness and efficiency. This is helpful when the collected data set is not dense everywhere. In this sense, our approach contains three steps: 1) the search process to discover visually and semantically similar search results, 2) the mining process to identify salient terms from textual descriptions of the search results, and 3) the annotation rejection process to filter out noisy terms yielded by Step 2. To ensure real-time annotation, two key techniques are leveraged-one is to map the high-dimensional image visual features into hash codes, the other is to implement it as a distributed system, of which the search and mining processes are provided as Web services. As a typical result, the entire process finishes in less than 1 second. Since no training data set is required, our approach enables annotating with unlimited vocabulary and is highly scalable and robust to outliers. Experimental results on both real Web images and a benchmark image data set show the effectiveness and efficiency of the proposed algorithm. It is also worth noting that, although the entire approach is illustrated within the divide-and-conquer framework, a query keyword is not crucial to our current implementation. We provide experimental results to prove this.

Efficient Tag Mining Via Mixture Modeling for Real-Time Search-Based Image Annotation.

Annotating Images by Mining Image Search Results

Image annotation using search and mining technologies.

Topic models for image annotation and text illustration

Image Annotation by Large-Scale Content-Based Image Retrieval

Automatic Image Annotation Using Social Group Latent Topic Mining and Multi-group Information Fusion

A Feature-Word-topic Model for Image Annotation and Retrieval

Distance Metric Learning from Uncertain Side Information with Application to Automated Photo Tagging

Automatic annotation of weakly-tagged social images on flickr using latent topic discovery of multiple groups

Automatic Image Annotations by Mining Web Image Data

Automatic Image Annotation Based on Wordnet and Hierarchical Ensembles

Automatic Video Annotation Through Search and Mining

Semi-automatic Dynamic Auxiliary-Tag-aided Image Annotation

Bridging the Semantic Gap Between Image Contents and Tags

A Semantic Context Model For Automatic Image Annotation

Automatic Image Annotation Based on Topic-Based Smoothing

A Hybrid Probabilistic Model for Unified Collaborative and Content-Based Image Tagging

A Search-Based Web Image Annotation Method

AnnoSearch: Image Auto-Annotation by Search

Annotating personal albums via web mining.