Distance Metric Learning from Uncertain Side Information with Application to Automated Photo Tagging
Lei Wu,Steven C. H. Hoi,Rong Jin,Jianke Zhu,Nenghai Yu
DOI: https://doi.org/10.1145/1631272.1631293
2009-01-01
Abstract:Automated photo tagging is essential to make massive unlabeled photos searchable by text search engines. Conventional image annotation approaches, though working reasonably well on small testbeds, are either computationally expensive or inaccurate when dealing with large-scale photo tagging. Recently, with the popularity of networking websites, we observe a massive number of user-tagged referred to as social images, that are available on the web. Unlike traditional web images often contain tags and other user-generated content, which offer a new opportunity to resolve some long-standing challenges in multimedia. In this work, we aim to address the challenge of large-scale automated photo tagging by exploring the images. We present a retrieval based approach for automated photo tagging. To tag a test image, the proposed approach first retrieves k images that share the largest visual similarity with the test image. The tags of the test image are then derived based on the tagging of the similar images. Due to the well-known semantic gap issue, a regular Euclidean distance-based retrieval method often fails to find semantically relevant images. To address the challenge of semantic gap, we propose a novel probabilistic distance metric learning scheme that (1) automatically derives constraints from the uncertain side information, and (2) efficiently learns a distance metric from the derived constraints. We apply the proposed technique to automated photo tagging tasks based on a image testbed with over 200,000 images crawled from Flickr. Encouraging results show that the proposed technique is effective and promising for automated photo tagging.