Abstract:This paper mainly focuses on how to effectively and efficiently measure visual similarity for local feature based representation. Among existing methods, metrics based on Bag of Visual Word (BoV) techniques are efficient and conceptually simple, at the expense of effectiveness. By contrast, kernel based metrics are more effective, but at the cost of greater computational complexity and increased storage requirements. We show that a unified visual matching framework can be developed to encompass both BoV and kernel based metrics, in which local kernel plays an important role between feature pairs or between features and their reconstruction. Generally, local kernels are defined using Euclidean distance or its derivatives, based either explicitly or implicitly on an assumption of Gaussian noise. However, local features such as SIFT and HoG often follow a heavy-tailed distribution which tends to undermine the motivation behind Euclidean metrics. Motivated by recent advances in feature coding techniques, a novel efficient local coding based matching kernel (LCMK) method is proposed. This exploits the manifold structures in Hilbert space derived from local kernels. The proposed method combines advantages of both BoV and kernel based metrics, and achieves a linear computational complexity. This enables efficient and scalable visual matching to be performed on large scale image sets. To evaluate the effectiveness of the proposed LCMK method, we conduct extensive experiments with widely used benchmark datasets, including 15-Scenes, Caltech101/256, PASCAL VOC 2007 and 2011 datasets. Experimental results confirm the effectiveness of the relatively efficient LCMK method.

Beyond the Euclidean Distance: Creating Effective Visual Codebooks Using the Histogram Intersection Kernel

Efficient and Effective Visual Codebook Generation Using Additive Kernels

Nonlinear Discrete Cross-Modal Hashing for Visual-Textual Data

Visual Recognition Using Density Adaptive Clustering

A Fast Dual Method For Hik Svm Learning

Unifying Discriminative Visual Codebook Generation with Classifier Training for Object Category Recognition

Efficient HIK SVM Learning for Image Classification.

Local coding based matching kernel method for image classification

Codebook Enhancement of Vlad Representation for Visual Recognition.

Metric Learning in Codebook Generation of Bag-of-Words for Person Re-identification

A Fast Algorithm For Creating A Compact And Discriminative Visual Codebook

Learning visual codebooks for image classification using spectral clustering

Visual place categorization

Improved k-means clustering method for codebook generation

Visual word coding based on difference maximization.

Improved K-means Algorithm Using Initialization Technique Based on Edge-Mean Grid for Image Vector Quantizer Design.

Building Descriptive and Discriminative Visual Codebook for Large-Scale Image Applications.

Bilevel Visual Words Coding for Image Classification

Fast Codebook Design Method for Image Vector Quantisation

Visual words assignment via information-theoretic manifold embedding.

Learning Compact Binary Codes Via Pairwise Correlation Reconstruction