Abstract:Bag-of-word (BOW) is used in many state-of-the-art methods of image classification, and it is especially suitable for multi-class classification. Many kinds of local features and classifiers are applicable for the BOW model. However, it is unclear which kind of local feature is the most distinctive and meanwhile robust, and which classifier can optimize classification performance. In this paper, we discuss the implementation choices in the BOW model. Further, we evaluate the influences of local features and classifiers on object and texture recognition methods in the framework of the BOW model. To evaluate the implementation choices, we use two popular datasets: the Xerox7 dataset and the UIUCTex dataset. Extensive experiments are carried out to compare the performance of different detectors, descriptors and classifiers in term of classification accuracy on the object category dataset and the texture dataset. We find that the combinational detector which combines the MSER detector with the Hessian-Laplacian detector is efficient to find discriminative regions. We also find that the SIFT descriptor performs better than the other descriptors for image classification, and that the SVM classifier with the EMD kernel is superior to other classifiers. More than that, we propose an EMD spatial kernel to encode the spatial information of local features. The EMD spatial kernel is implemented on the Xerox7 dataset, the 4-class VOC2006 dataset and the 4-class Caltech101 dataset. The experimental results show that the proposed kernel outperforms the EMD kernel which does not consider the spatial information in image classification.

The Comparison of Classifiers for Object Categorization Based on Bag-of-Word Technology

Object Recognition Based on the Region of Interest and Optimal Bag of Words Model.

A Pca Based Automatic Image Categorization Approach Using Dominant Color Features

Evaluation of Local Features and Classifiers in Bow Model for Image Classification

Object categorization based on a supervised mean shift algorithm

An image classification method based on bag of words model

Towards Optimal Bag-of-features for Object Categorization and Semantic Video Retrieval

Image classification using adapted codebook

Create Efficient Visual Codebook Based on Weighted Mrmr for Object Categorization

The Extended Bag of Words Model for Visual Recognition and Categorization

Understanding bag-of-words model: a statistical framework

Soft Measure of Visual Token Occurrences for Object Categorization

Object Categorization in Sub-Semantic Space

Optimal operations for visual categorization.

Object Categorization Using Hierarchical Wavelet Packet Texture Descriptors

Object Recognition via Adaptive Multi-level Feature Integration

Experimental Comparisons of Multi-class Classifiers.

Object Classification of Remote Sensing Images Based on BOV

Contextual Bag-of-Words for Visual Categorization

Kernelized Pyramid Nearest-Neighbor Search for Object Categorization

From Bag Of Categories To Tree Of Object Recognition