Abstract:Bag-of-words (BOW), which represents an image by the histogram of local patches on the basis of a visual vocabulary, has attracted intensive attention in visual categorization due to its good performance and flexibility. Conventional BOW neglects the contextual relations between local patches due to its Naïve Bayesian assumption. However, it is well known that contextual relations play an important role for human beings to recognize visual categories from their local appearance. This paper proposes a novel contextual bag-of-words (CBOW) representation to model two kinds of typical contextual relations between local patches, i.e., a semantic conceptual relation and a spatial neighboring relation. To model the semantic conceptual relation, visual words are grouped on multiple semantic levels according to the similarity of class distribution induced by them, accordingly local patches are encoded and images are represented. To explore the spatial neighboring relation, an automatic term extraction technique is adopted to measure the confidence that neighboring visual words are relevant. Word groups with high relevance are used and their statistics are incorporated into the BOW representation. Classification is taken using the support vector machine with an efficient kernel to incorporate the relational information. The proposed approach is extensively evaluated on two kinds of visual categorization tasks, i.e., video event and scene categorization. Experimental results demonstrate the importance of contextual relations of local patches and the CBOW shows superior performance to conventional BOW.

Visual Attention Based Bag-of-words Model for Image Classification

Object Recognition Based on the Region of Interest and Optimal Bag of Words Model.

Bag-of-Visual-Words Model Based on Classified Vector Quantization and Its Application in Image Classification

Discriminative Image Representation for Classification

Modified Bag of Visual Words Model for Image Classification

The Extended Bag of Words Model for Visual Recognition and Categorization

An image classification method based on bag of words model

Spatial Encoding of Visual Words for Image Classification.

An Approach to Improving Image Classification Using Visual Attention Weight Order

Nearest-Neighbors Based Weighted Method for the BOVW Applied to Image Classification

Image Classification Based on Modified BOW Model

Image classification by visual bag-of-words refinement and reduction

Visual saliency coding for image categorization

Visual language modeling for image classification.

Contextual Bag-of-Words for Visual Categorization

Visual Words Ambiguity Analysis in BOW Model

Bag-of-words image representation based on classified vector quantization

Visual Attention Based Image Classification

Object Classification of Aerial Images with Bag-of-Visual Words

Visual word coding based on difference maximization.

How to use Bag-of-Words model better for image classification