Abstract:Recent research emphasizes more on analyzing multiple features to improve face recognition (FR) performance. One popular scheme is to extend the sparse representation based classification framework with various sparse constraints. Although these methods jointly study multiple features through the constraints, they just process each feature individually such that they overlook the possible high-level relationship among different features. It is reasonable to assume that the low-level features of facial images, such as edge information and smoothed/low-frequency image, can be fused into a more compact and more discriminative representation based on the latent high-level relationship. FR on the fused features is anticipated to produce better performance than that on the original features, since they provide more favorable properties. Focusing on this, we propose two different strategies which start from fusing multiple features and then exploit the dictionary learning (DL) framework for better FR performance. The first strategy is a simple and efficient two-step model, which learns a fusion matrix from training face images to fuse multiple features and then learns class-specific dictionaries based on the fused features. The second one is a more effective model requiring more computational time that learns the fusion matrix and the class-specific dictionaries simultaneously within an iterative optimization procedure. Besides, the second model considers to separate the shared common components from class-specified dictionaries to enhance the discrimination power of the dictionaries. The proposed strategies, which integrate multi-feature fusion process and dictionary learning framework for FR, realize the following goals: (1) exploiting multiple features of face images for better FR performances; (2) learning a fusion matrix to merge the features into a more compact and more discriminative representation; (3) learning class-specific dictionaries with consideration of the common patterns for better classification performance. We perform a series of experiments on public available databases to evaluate our methods, and the experimental results demonstrate the effectiveness of the proposed models.

A Joint Evaluation of Dictionary Learning and Feature Encoding for Action Recognition.

Integration of multi-feature fusion and dictionary learning for face recognition

Learning Individual-Specific Dictionaries With Fused Multiple Features For Face Recognition

Action Recognition with Stacked Fisher Vectors.

Encoding Learning Network Combined with Feature Similarity Constraints for Human Action Recognition

Joint Feature Optimization and Fusion for Compressed Action Recognition

A Compact Representation of Human Actions by Sliding Coordinate Coding

Action-Stage Emphasized Spatiotemporal VLAD for Video Action Recognition

Constructing Visual Vocabularies Using Sparse Coding for Action Recognition

Hyper-Fisher Vectors for Action Recognition

Towards Good Practices for Action Video Encoding

Embedding Motion and Structure Features for Action Recognition

Learning Comprehensive Motion Representation for Action Recognition

Hierarchical Dynamic Parsing And Encoding For Action Recognition

Human action recognition via multi-view learning.

DA-VLAD: Discriminative Action Vector of Locally Aggregated Descriptors for Action Recognition

Discriminative Multi-View Subspace Feature Learning for Action Recognition

Action Recognition By Learning Deep Multi-Granular Spatio-Temporal Video Representation

An Improved Action Recognition Network With Temporal Extraction and Feature Enhancement

Learning Hierarchical Video Representation for Action Recognition

Unsupervised Hierarchical Dynamic Parsing and Encoding for Action Recognition