Abstract:Auto face annotation aims to detect human faces from a facial image and tag the faces with the corresponding human names. It is a fundamental research problem and plays a critical role for many real-world applications in computer vision and pattern recognition. Instead of adopting traditional “model-based face annotation” techniques, the “search-based face annotation” recently has been gaining increasing attentions for mining large amounts of weakly labeled facial images freely available on the Internet. Although several web facial image databases have been constructed by researchers in literature (e.g., LFW, Yahoo!News, FAN-large, and PubFig), they are not suitable for addressing the search-based face annotation problem. To facilitate the research of search-based face annotation, we build WLFDB — a large-scale Weakly Labeled Face Database, with three major characteristics: (i) Large-scale Weakly Labeled Facial Images: WLFDB consists of over 6, 000 subjects and more than 700, 000 web facial images, where the images are weakly labeled with names from the query text; (ii) Rich Data Types: We make WLFDB a comprehensive testbed with three types of data: “raw web facial images”, “aligned facial images”, and “facial feature representations”. Researchers can easily use WLFDB with any type of data with minimal efforts; (iii) Benchmark Protocol: We provide a standard benchmark protocol to evaluate the performance of different search-based face annotation techniques based on the same ground truth test set. In addition, three baseline algorithms are evaluated based on the same test sets according to the hit rate metric. In summary, WLFDB is a large-scale weakly labeled facial images database that attempts to model real-world web facial images. We hope it will not only facilitate the research of search-based face annotation, but also benefit other kinds of face related research, such as face detection, alignment, verification, and recognition, etc. WLFDB is freely available to public for non-commercial research purposes at http://wlfdb.stevenhoi.com/.

Finding Celebrities in Billions of Web Images

FANS: Face Annotation by Searching Large-scale Web Facial Images.(2013). Research Collection School Of Information Systems

FANS: face annotation by searching large-scale web facial images.

Learning to Name Faces

Celelabel: An Interactive System For Annotating Celebrities In Web Videos

Name-Face Association in Web Videos：A Large-Scale Dataset, Baselines, and Open Issues

Retrieval-Based Face Annotation by Weak Label Regularized Local Coordinate Coding

Mining Weakly Labeled Web Facial Images for Search-Based Face Annotation

Duplicate-Search-Based Image Annotation Using Web-Scale Data.

Image Annotation by Large-Scale Content-Based Image Retrieval

Automated Video Labelling: Identifying Faces by Corroborative Evidence

Improving Automatic Name-Face Association Using Celebrity Images on the Web

Wlfdb: Weakly labeled face databases

ARISTA - Image Search to Annotation on Billions of Web Photos

CelebV-HQ: A Large-Scale Video Facial Attributes Dataset

Automatic Image Annotations by Mining Web Image Data

Annotating Images by Mining Image Search Results

Context-Oriented Name-Face Association in Web Videos.

Dataset Cleaning -- A Cross Validation Methodology for Large Facial Datasets using Face Recognition

Duplicate-Search-Based

15M Multimodal Facial Image-Text Dataset