Abstract:By characterizing each image set as a nonsingular covariance matrix on the symmetric positive definite (SPD) manifold, the approaches of visual content classification with image sets have made impressive progress. However, the key challenge of unhelpfully large intraclass variability and interclass similarity of representations remains open to date. Although, several recent studies have mitigated the two problems by jointly learning the embedding mapping and the similarity metric on the original SPD manifold, their inherent shallow and linear feature transformation mechanism are not powerful enough to capture useful geometric features, especially in complex scenarios. To this end, this article explores a novel approach, termed SPD manifold deep metric learning (SMDML), for image set classification. Specifically, SMDML first selects a prevailing SPD manifold neural network (SPDNet) as the backbone (encoder) to derive an SPD matrix nonlinear representation. To counteract the degradation of structural information during multistage feature embedding, we construct a Riemannian decoder at the end of the encoder, trained by a reconstruction error term (RT), to induce the generated low-dimensional feature manifold of the hidden layer to capture the pivotal information about the visual data describing the imaged scene. We demonstrate through theory and experiments that it is feasible to replace the Riemannian metric with Euclidean distance in RT. Then, the ReCov layer is introduced into the established Riemannian network to regularize the local statistical information within each input feature matrix, which enhances the effectiveness of the learning process. The theoretical analysis of the activation function used in the ReCov layer in terms of continuity and conditions for generating positive definite matrices is beneficial for network design. Inspired by the fact that the single cross-entropy loss used for training is unable to effectively parse the geometric distribution of the deep representations, we finally endow the suggested model with a novel metric learning regularization term. By explicitly incorporating the encoding and processing of the data variations into the network learning process, this term can not only derive a powerful Riemannian representation but also train an effective classifier. The experimental results show the superiority of the proposed approach on three typical visual classification tasks.

DeepKSPD: Learning Kernel-Matrix-Based SPD Representation for Fine-Grained Image Recognition

Learning a Robust Representation via a Deep Network on Symmetric Positive Definite Manifolds

A Riemannian Network for SPD Matrix Learning

More About Covariance Descriptors for Image Set Coding: Log-Euclidean Framework based Kernel Matrix Representation

Collaborative Representation for SPD Matrices with Application to Image-Set Classification

Log-Euclidean Kernels for Sparse Representation and Dictionary Learning

SPD Manifold Deep Metric Learning for Image Set Classification

Learning Explicit Deep Representations from Deep Kernel Networks

Revisiting Metric Learning for SPD Matrix Based Visual Representation.

Covariance Discriminative Learning: A Natural and Efficient Approach to Image Set Classification

Learning Discriminative Stein Kernel for SPD Matrices and Its Applications

Learning Partial Correlation based Deep Visual Representation for Image Classification

Deep Metric Learning on the SPD Manifold for Image Set Classification

Manifold Kernel Sparse Representation of Symmetric Positive-Definite Matrices and Its Applications

Supervised Kernel Descriptors for Visual Recognition

Kernel collaborative face recognition

Robust Dictionary Learning and Sparse Coding with Riemannian Geometry Preserving Method in Symmetric Matrices Inner Product Space.

Deep Covariance Descriptors for Facial Expression Recognition

Hierarchical Feature Concatenation-Based Kernel Sparse Representations for Image Categorization

Kernelized Subspace Pooling for Deep Local Descriptors.

Deep Spatial Pyramid: The Devil is Once Again in the Details.