Image Tag Refinement with View-Dependent Concept Representations

Jianlong Fu,Jinqiao Wang,Yong Rui,Xin-Jing Wang,Tao Mei,Hanqing Lu
DOI: https://doi.org/10.1109/tcsvt.2014.2380211
IF: 5.859
2015-01-01
IEEE Transactions on Circuits and Systems for Video Technology
Abstract:Image tag refinement is the task of refining initial tags of an image such that the refined tags can better reflect the content of the image and, therefore, can help users better access that image. The quality of tag refinement depends on the quality of concept representations that build a mapping from concepts to visual images. While good progress was made in the past decade on tag refinement, the previous approaches only achieved a limited success due to their limited concept representations. In this paper, we show that the visual appearances of a concept consist of both a generic view and a specific view, and therefore we can comprehensively represent a concept by two components. To ensure a clean concept representation, this representation is learned on clean click-through data, where noises are greatly reduced. In the framework, a coarse-to-fine image tag refinement is proposed, which: 1) first generates an efficient star graph to find candidate tags but missing in the initial tag list of an input image and 2) guided by this view-dependent concept representation, formulates a probabilistic objective function to eliminate irrelevant tags. Extensive experiments on two widely used standard data sets (MIRFlickr-25K and NUS-WIDE-270K) demonstrate the effectiveness of our approach.
What problem does this paper attempt to address?