Abstract:The idea of representing images using a bag of visual words is currently popular in object category recognition. Since this representation is typically constructed using unsupervised clustering, the resulting visual words may not capture the desired information. Recent work has explored the construction of discriminative visual codebooks that explicitly consider object category information. However, since the codebook generation process is still disconnected from that of classifier training, the set of resulting visual words, while individually discriminative, may not be those best suited for the classifier This paper proposes a novel optimization framework that unifies codebook generation with classifier training. In our approach, each image feature is encoded by a sequence of "visual bits" optimized for each category. An image, which can contain objects from multiple categories, is represented using aggregates of visual bits for each category. Classifiers associated with different categories determine how well a given image corresponds to each category. Based on the performance of these classifiers on the training data, we augment the visual words by generating additional bits. The classifiers are then updated to incorporate the new representation. These two phases are repeated until the desired performance is achieved. Experiments compare our approach to standard clustering-based methods and with state-of-the-art discriminative visual codebook generation. The significant improvements over previous techniques clearly demonstrate the value of unifying representation and classification into a single optimization framework.

An Integrative Codebook for Natural Scene Categorization

A Pca Based Automatic Image Categorization Approach Using Dominant Color Features

Non-Negative Sparse Semantic Coding for Text Categorization

SSC: Semantic Scan Context for Large-Scale Place Recognition

Creating Efficient Visual Codebook Ensembles for Object Categorization

Incorporating Spatial Correlogram into Bag-of-features Model for Scene Categorization

Improvements in image categorization using codebook ensembles

SENSC Algorithm for Object and Scene Categorization.

Learning Discriminative Visual Dictionary for Natural Scene Categorization

Unifying Discriminative Visual Codebook Generation with Classifier Training for Object Category Recognition

Scene Categorization by Deeply Learning Gaze Behavior in a Semisupervised Context

Adjacent Coding for Image Classification.

Visual place categorization

Multi-scale sparse representation based scene categorization

Learning a Probabilistic Topology Discovering Model for Scene Categorization.

Double-Keggin-type Anion-templated Synthesis of a 3D Porous Coordination Polymer of Eu(III) Ions and dpdo Ligands

A Two-Layer Local Constrained Sparse Coding Method for Fine-Grained Visual Categorization

Channel Interaction Networks for Fine-Grained Image Categorization

Building Descriptive and Discriminative Visual Codebook for Large-Scale Image Applications.

Codebook Enhancement of Vlad Representation for Visual Recognition.

Scene categorization via supervised subspace modeling and sparse representation