Abstract:State-of-the-art image classification approaches are mainly based on robust image representation, such as the bag-of-features (BoF) model or the convolutional neural network (CNN) architecture. In real applications, the orientation (left/right) of an image or an object might vary from sample to sample, whereas some handcrafted descriptors (e.g., SIFT) and network operations (e.g., convolution) are not reversal-invariant, leading to the unsatisfied stability of image features extracted from these models. To deal with, a popular solution is to augment the dataset by adding a left-right reversed copy for each image. This strategy improves the recognition accuracy to some extent, but also brings the price of almost doubled time and memory consumptions on both the training and testing stages. In this paper, we present an alternative solution based on designing reversal-invariant representation of local patterns, so that we can obtain the identical representation for an image and its left-right reversed copy. For the BoF model, we design a reversal-invariant version of SIFT descriptor named Max-SIFT, a generalized RIDE algorithm which can be applied to a large family of local descriptors. For the CNN architecture, we present a simple idea of generating reversal-invariant deep features (RI-Deep), and, inspired by which, design reversal-invariant convolution (RI-Conv) layers to increase the CNN capacity without increasing the model complexity. Experiments reveal consistent accuracy gain on various image classification tasks, including scene understanding, fine-grained object recognition, and large-scale visual recognition.

Image classification with Max-SIFT descriptors

Max-SIFT: Flipping invariant descriptors for Web logo search

KPB-SIFT

Towards Reversal-Invariant Image Representation.

Face Recognition based on scale invariant feature transform and Spatial Pyramid Representation

Efficient Image Copy Detection Using Multiscale Fingerprints

Affine-invariant SIFT Descriptor with Global Context

Fast SIFT algorithm based on Sobel edge detector

A Comparative Study of SIFT and Its Variants

The Image Matching Method Based on the Improved SIFT Descriptor

MFD: Mutual feature description for image matching

S-SIFT: A Shorter SIFT without Least Discriminability Visual Orientation

Cgci-Sift: A More Efficient and Compact Representation of Local Descriptor

Improving Scale Invariant Feature Transform-Based Descriptors with Shape-Color Alliance Robust Feature

DCT Inspired Feature Transform for Image Retrieval and Reconstruction.

Remote Sensing Image Matching Based on Adaptive Binning SIFT Descriptor

Scale and Color Invariant Image Feature Description

Edge-SIFT: Discriminative Binary Descriptor for Scalable Partial-Duplicate Mobile Search.

Modified Bag of Visual Words Model for Image Classification

Deep Sparse Informative Transfer SoftMax for Cross-Domain Image Classification.

Cross-Indexing of Binary Sift Codes for Large-Scale Image Search