Abstract:Although it has been studied for years by the computer vision and machine learning communities, image annotation is still far from practical. In this paper, we propose a novel attempt at model-free image annotation, which is a data-driven approach that annotates images by mining their search results. Some 2.4 million images with their surrounding text are collected from a few photo forums to support this approach. The entire process is formulated in a divide-and-conquer framework where a query keyword is provided along with the uncaptioned image to improve both the effectiveness and efficiency. This is helpful when the collected data set is not dense everywhere. In this sense, our approach contains three steps: 1) the search process to discover visually and semantically similar search results, 2) the mining process to identify salient terms from textual descriptions of the search results, and 3) the annotation rejection process to filter out noisy terms yielded by Step 2. To ensure real-time annotation, two key techniques are leveraged-one is to map the high-dimensional image visual features into hash codes, the other is to implement it as a distributed system, of which the search and mining processes are provided as Web services. As a typical result, the entire process finishes in less than 1 second. Since no training data set is required, our approach enables annotating with unlimited vocabulary and is highly scalable and robust to outliers. Experimental results on both real Web images and a benchmark image data set show the effectiveness and efficiency of the proposed algorithm. It is also worth noting that, although the entire approach is illustrated within the divide-and-conquer framework, a query keyword is not crucial to our current implementation. We provide experimental results to prove this.

Web image interpretation: semi-supervised mining annotated words

Image Interpretation: Mining the Visible and Syntactic Correlation of Annotated Words

Search-Based Automatic Web Image Annotation Using Latent Visual and Semantic Analysis

Annotating Images by Mining Image Search Results

Automatic Image Annotations by Mining Web Image Data

Image annotation using search and mining technologies.

Automatic semantic annotation of images based on Web data.

Automatic Image Annotation Based on Wordnet and Hierarchical Ensembles

Mining Weakly Labeled Web Facial Images for Search-Based Face Annotation

Improve Web Image Retrieval by Refining Image Annotations

AnnoSearch: Image Auto-Annotation by Search

A Search-Based Web Image Annotation Method

Image Annotation by Large-Scale Content-Based Image Retrieval

Automatic web image annotation via web-scale image semantic space learning

FANS: face annotation by searching large-scale web facial images.

FANS: Face Annotation by Searching Large-scale Web Facial Images.(2013). Research Collection School Of Information Systems

An Image Retrieval And Semi-Automatic Annotation Scheme For Large Image Databases On The Web

Adaptive Model for Integrating Different Types of Associated Texts for Automated Annotation of Web Images

A Novel Data-driven Image Annotation Method

Improving Web-Based Learning: Automatic Annotation of Multimedia Semantics and Cross-Media Indexing