Abstract:Recent research emphasizes more on analyzing multiple features to improve face recognition (FR) performance. One popular scheme is to extend the sparse representation based classification framework with various sparse constraints. Although these methods jointly study multiple features through the constraints, they just process each feature individually such that they overlook the possible high-level relationship among different features. It is reasonable to assume that the low-level features of facial images, such as edge information and smoothed/low-frequency image, can be fused into a more compact and more discriminative representation based on the latent high-level relationship. FR on the fused features is anticipated to produce better performance than that on the original features, since they provide more favorable properties. Focusing on this, we propose two different strategies which start from fusing multiple features and then exploit the dictionary learning (DL) framework for better FR performance. The first strategy is a simple and efficient two-step model, which learns a fusion matrix from training face images to fuse multiple features and then learns class-specific dictionaries based on the fused features. The second one is a more effective model requiring more computational time that learns the fusion matrix and the class-specific dictionaries simultaneously within an iterative optimization procedure. Besides, the second model considers to separate the shared common components from class-specified dictionaries to enhance the discrimination power of the dictionaries. The proposed strategies, which integrate multi-feature fusion process and dictionary learning framework for FR, realize the following goals: (1) exploiting multiple features of face images for better FR performances; (2) learning a fusion matrix to merge the features into a more compact and more discriminative representation; (3) learning class-specific dictionaries with consideration of the common patterns for better classification performance. We perform a series of experiments on public available databases to evaluate our methods, and the experimental results demonstrate the effectiveness of the proposed models.

A Complementary Fusion Strategy for RGB-D Face Recognition

Integration of multi-feature fusion and dictionary learning for face recognition

Improving RGB-D Face Recognition via Transfer Learning from a Pretrained 2D Network.

Exploiting Enhanced and Robust RGB-D Face Representation Via Progressive Multi-Modal Learning

Exploiting Multi-modal Fusion for Robust Face Representation Learning with Missing Modality

Confidence-Aware RGB-D Face Recognition Via Virtual Depth Synthesis

Recurrent Convolutional Fusion for RGB-D Object Recognition

Fusion of color, local spatial and global frequency information for face recognition

Improved RGB-D-T Based Face Recognition.

Two-Level Attention-based Fusion Learning for RGB-D Face Recognition

Depth Map Denoising Network and Lightweight Fusion Network for Enhanced 3D Face Recognition

Fusion of Multiple Facial Regions for Expression-Invariant Face Recognition

SlowFast Multimodality Compensation Fusion Swin Transformer Networks for RGB-D Action Recognition

Face Recognition Against Pose Variations Using Multi-Resolution Multiple Colour Fusion

Two Directional Multiple Colour Fusion for Face Recognition

DCFNet: Dense Complementary Fusion for RGB-Thermal Urban Scene Perception

Improving 2D face recognition via fine-level facial depth generation and RGB-D complementary feature learning

Multiple feature fusion for face recognition

A Transformer-based multi-modal fusion network for 6D pose estimation

A Uniform Transformer-Based Structure for Feature Fusion and Enhancement for RGB-D Saliency Detection