Abstract:Although it has been studied for years by the computer vision and machine learning communities, image annotation is still far from practical. In this paper, we propose a novel attempt at model-free image annotation, which is a data-driven approach that annotates images by mining their search results. Some 2.4 million images with their surrounding text are collected from a few photo forums to support this approach. The entire process is formulated in a divide-and-conquer framework where a query keyword is provided along with the uncaptioned image to improve both the effectiveness and efficiency. This is helpful when the collected data set is not dense everywhere. In this sense, our approach contains three steps: 1) the search process to discover visually and semantically similar search results, 2) the mining process to identify salient terms from textual descriptions of the search results, and 3) the annotation rejection process to filter out noisy terms yielded by Step 2. To ensure real-time annotation, two key techniques are leveraged-one is to map the high-dimensional image visual features into hash codes, the other is to implement it as a distributed system, of which the search and mining processes are provided as Web services. As a typical result, the entire process finishes in less than 1 second. Since no training data set is required, our approach enables annotating with unlimited vocabulary and is highly scalable and robust to outliers. Experimental results on both real Web images and a benchmark image data set show the effectiveness and efficiency of the proposed algorithm. It is also worth noting that, although the entire approach is illustrated within the divide-and-conquer framework, a query keyword is not crucial to our current implementation. We provide experimental results to prove this.

ARISTA - Image Search to Annotation on Billions of Web Photos

Duplicate-Search-Based Image Annotation Using Web-Scale Data.

Duplicate-Search-Based

Knowing a Tree from the Forest

Knowing a tree from the forest: art image retrieval using a society of profiles.

Image Annotation by Large-Scale Content-Based Image Retrieval

Annotating Images by Mining Image Search Results

AnnoSearch: Image Auto-Annotation by Search

FANS: Face Annotation by Searching Large-scale Web Facial Images.(2013). Research Collection School Of Information Systems

Image annotation using search and mining technologies.

Automatic Image Annotations by Mining Web Image Data

FANS: face annotation by searching large-scale web facial images.

A Novel Data-driven Image Annotation Method

Annotating personal albums via web mining.

Automatic semantic annotation of images based on Web data.

Bridging the Semantic Gap Between Image Contents and Tags

A Search-Based Web Image Annotation Method

An Image Retrieval And Semi-Automatic Annotation Scheme For Large Image Databases On The Web

Search-Based Automatic Web Image Annotation Using Latent Visual and Semantic Analysis

Evolution of a Web-Scale Near Duplicate Image Detection System

Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets