Abstract:Automatic image annotation concerns a process of automatically labeling image contents with a pre-defined set of keywords, which are regarded as descriptors of image high-level semantics, so as to enable semantic image retrieval via keywords. A serious problem in this task is the unsatisfactory annotation performance due to the semantic gap between the visual content and keywords. Targeting at this problem, we present a new approach that tries to incorporate lexical semantics into the image annotation process. In the phase of training, given a training set of images labeled with keywords, a basic visual vocabulary consisting of visual terms, extracted from the image to represent its content, and the associated keywords is generated at first, using K-means clustering combined with semantic constraints obtained from WordNet, then the statistical correlation between visual terms and keywords is modeled by a two-level hierarchical ensemble model composed of probabilistic SVM classifiers and a co-occurrence language model. In the phase of annotation, given an unlabeled image, the most likely associated keywords are predicted by the posterior probability of each keyword given each visual term at the first-level classifier ensemble, then the second-level language model is used to refine the annotation quality by word co-occurrence statistics derived from the annotated keywords in the training set of images. We carried out experiments on a medium-sized image collection from Corel Stock Photo CDs. The experimental results demonstrated that the annotation performance of this method outperforms some traditional annotation methods by about 7% in average precision, showing the feasibility and effectiveness of the proposed approach.

Statistical Modeling and Conceptualization of Natural Images

Automatic image annotation by using concept-sensitive salient objects for image content representation.

Model Semantic Relations with Extended Attributes

A Semantic Context Model For Automatic Image Annotation

Towards Multi-Semantic Image Annotation with Graph Regularized Exclusive Group Lasso

Salient Objects: Semantic Building Blocks For Image Concept Interpretation

A Probabilistic Semantic Model for Image Annotation and Multi-Modal Image Retrieval

A stratification-based approach to accurate and fast image annotation

Automatic Image Annotation Based on Wordnet and Hierarchical Ensembles

Modeling Image Data for Effective Indexing and Retrieval in Large General Image Databases.

Multimodal Salient Objects: General Building Blocks Of Semantic Video Concepts

Automatic Image Annotation Based-On Model Space

Automatic image annotation via local multi-label classification

Automatic Model-Based Semantic Object Extraction Algorithm.

Image annotation using the summation of negative probability based on SVM

Semi-supervised topic modeling for image annotation.

Discovering Visual Concept Structure with Sparse and Incomplete Tags

Modeling semantic aspects for cross-media image indexing

Semantic Video Classification And Feature Subset Selection Under Context And Concept Uncertainty

Automatic image annotation based on salient regions

ImageSaker : A Semantic-based Image Retrieval System Refining with Concept Model