Abstract:In recent years, the bag of visual words (BoV) model is very popular in the field of computer vision. In particular, it has been widely used for image classification. The earliest method based on the BoV model, when constructing the visual dictionary, uses the clustering method to generate a single visual dictionary. This method does not consider the differences between categories of image dataset, leading to a poor performance of image classification. Later, an improved method is proposed that builds a visual dictionary for each category in the image dataset and then concatenates the visual dictionaries to produce the final visual dictionary. However, when there are many categories in the image dataset, using this method is time consuming. In addition, building a visual dictionary for each category leads to the visual words being too dense, losing the ability of distinguishing and expressing information and decreasing the performance of the image classification. In this paper, we proposed a new image classification method that is based on multiple visual dictionaries. This method addresses the above problems effectively and improves the performance of image classification significantly. To build a visual dictionary, the proposed image classification method first divides the image dataset containing many categories into N items randomly and uses the clustering method to cluster the local features of images of N items respectively. Next it generates a visual dictionary for each item in the image dataset. Finally it connects the visual dictionaries to produce the final visual dictionary of the entire image dataset. We conducted experiments on the caltech-101 and scene-67 image libraries and compared to the traditional methods. Our experimental results show that our proposed image classification method can match the best results published in the previous literatures in terms of classification accuracy rate.

Learning Discriminative Visual Dictionary for Natural Scene Categorization

CRF with Locality-Consistent Dictionary Learning for Semantic Segmentation

Explicit Feature Disentanglement for Visual Place Recognition Across Appearance Changes

Multi-level Discriminative Dictionary Learning with Application to Large Scale Image Classification.

Learning Category-Specific Dictionary and Shared Dictionary for Fine-Grained Image Categorization

Scene Categorization by Deeply Learning Gaze Behavior in a Semisupervised Context

Learning Regions and Descriptors for Fine-grained Recognition

Refining local descriptors by embedding semantic information for visual categorization.

Dictionary Learning Inspired Deep Network for Scene Recognition

Scene categorization via supervised subspace modeling and sparse representation

Decoupling Visual-Semantic Feature Learning for Robust Scene Text Recognition

Learning a Probabilistic Topology Discovering Model for Scene Categorization.

Object Recognition via Adaptive Multi-level Feature Integration

Learning attribute-aware dictionary for image classification and search

Auto‐encoder‐based Shared Mid‐level Visual Dictionary Learning for Scene Classification Using Very High Resolution Remote Sensing Images

Weakly Supervised PatchNets : Learning Aggregated Patch Descriptors for Scene Recognition

Deep Discriminative Representation Learning with Attention Map for Scene Classification

Modeling spatial and semantic cues for large-scale near-duplicated image retrieval

An Image Classification Method Based on Multiple Visual Dictionaries

Webly-Supervised Fine-Grained Visual Categorization Via Deep Domain Adaptation.

Deeply Encoding Stable Patterns From Contaminated Data for Scenery Image Recognition