Abstract:Although it has been studied for years by the computer vision and machine learning communities, image annotation is still far from practical. In this paper, we propose a novel attempt at model-free image annotation, which is a data-driven approach that annotates images by mining their search results. Some 2.4 million images with their surrounding text are collected from a few photo forums to support this approach. The entire process is formulated in a divide-and-conquer framework where a query keyword is provided along with the uncaptioned image to improve both the effectiveness and efficiency. This is helpful when the collected data set is not dense everywhere. In this sense, our approach contains three steps: 1) the search process to discover visually and semantically similar search results, 2) the mining process to identify salient terms from textual descriptions of the search results, and 3) the annotation rejection process to filter out noisy terms yielded by Step 2. To ensure real-time annotation, two key techniques are leveraged-one is to map the high-dimensional image visual features into hash codes, the other is to implement it as a distributed system, of which the search and mining processes are provided as Web services. As a typical result, the entire process finishes in less than 1 second. Since no training data set is required, our approach enables annotating with unlimited vocabulary and is highly scalable and robust to outliers. Experimental results on both real Web images and a benchmark image data set show the effectiveness and efficiency of the proposed algorithm. It is also worth noting that, although the entire approach is illustrated within the divide-and-conquer framework, a query keyword is not crucial to our current implementation. We provide experimental results to prove this.

Duplicate-Search-Based

Duplicate-Search-Based Image Annotation Using Web-Scale Data.

ARISTA - Image Search to Annotation on Billions of Web Photos

AnnoSearch: Image Auto-Annotation by Search

Annotating Images by Mining Image Search Results

Image annotation using search and mining technologies.

FANS: Face Annotation by Searching Large-scale Web Facial Images.(2013). Research Collection School Of Information Systems

A Search-Based Web Image Annotation Method

Image Annotation by Large-Scale Content-Based Image Retrieval

Evolution of a Web-Scale Near Duplicate Image Detection System

Search-Based Automatic Web Image Annotation Using Latent Visual and Semantic Analysis

FANS: face annotation by searching large-scale web facial images.

Large-Scale Duplicate Detection for Web Image Search

An Image Retrieval And Semi-Automatic Annotation Scheme For Large Image Databases On The Web

Learning to Name Faces

Automatic Image Annotations by Mining Web Image Data

Annotating personal albums via web mining.

A Novel Data-driven Image Annotation Method

Fast and accurate near-duplicate image search with affinity propagation on the ImageWeb

Distance Metric Learning from Uncertain Side Information with Application to Automated Photo Tagging