Abstract:Auto face annotation, which aims to detect human faces from a facial image and assign them proper human names, is a fundamental research problem and beneficial to many real-world applications. In this work, we address this problem by investigating a retrieval-based annotation scheme of mining massive web facial images that are freely available over the Internet. In particular, given a facial image, we first retrieve the top n similar instances from a large-scale web facial image database using content-based image retrieval techniques, and then use their labels for auto annotation. Such a scheme has two major challenges: 1) how to retrieve the similar facial images that truly match the query, and 2) how to exploit the noisy labels of the top similar facial images, which may be incorrect or incomplete due to the nature of web images. In this paper, we propose an effective Weak Label Regularized Local Coordinate Coding (WLRLCC) technique, which exploits the principle of local coordinate coding by learning sparse features, and employs the idea of graph-based weak label regularization to enhance the weak labels of the similar facial images. An efficient optimization algorithm is proposed to solve the WLRLCC problem. Moreover, an effective sparse reconstruction scheme is developed to perform the face annotation task. We conduct extensive empirical studies on several web facial image databases to evaluate the proposed WLRLCC algorithm from different aspects. The experimental results validate its efficacy. We share the two constructed databases "WDB" (714,454 images of 6,025 people) and "ADB" (126,070 images of 1,200 people) with the public. To further improve the efficiency and scalability, we also propose an offline approximation scheme (AWLRLCC) which generally maintains comparable results but significantly reduces the annotation time.

Web-Scale Image Annotation

Distance Metric Learning from Uncertain Side Information with Application to Automated Photo Tagging

Retrieval-Based Face Annotation by Weak Label Regularized Local Coordinate Coding

Tag-LDA for scalable real-time tag recommendation

Bridging the Semantic Gap Between Image Contents and Tags

Large scale microblog mining using distributed MB-LDA.

Web and personal image annotation by mining label correlation with relaxed visual graph embedding.

Automatic web image annotation via web-scale image semantic space learning

Image Annotation by Large-Scale Content-Based Image Retrieval

Duplicate-Search-Based Image Annotation Using Web-Scale Data.

Supervised LDA for Image Annotation

Search-Based Automatic Web Image Annotation Using Latent Visual and Semantic Analysis

Efficient large-scale image annotation by probabilistic collaborative multi-label propagation.

What If We Recaption Billions of Web Images with LLaMA-3?

Automatic Image Annotations by Mining Web Image Data

Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets

Semi-automatic Dynamic Auxiliary-Tag-aided Image Annotation

An Algorithm for the Automatic Annotation Refinement on Large-Scale Web Images

Automatic Data Augmentation from Massive Web Images for Deep Visual Recognition

LabelMe: Online Image Annotation and Applications

Large-scale Remote Sensing Image Target Recognition and Automatic Annotation