Abstract:Near-Infrared and Visible light (NIR-VIS) face recognition methods have achieved remarkable success in the fields of security surveillance, criminal investigation, and multimedia information retrieval. But the existing methods heavily rely on carefully annotated labels, leading to expensive manual labelling consumption and deployment flexibility. This motivates us to design unsupervised methods to address NIR-VIS recognition without relying on label information. To this end, we propose a novel homogeneous-to-HEterogeneous learning and Residual-invariant Enhancement (HERE) network for Unsupervised NIR-VIS Heterogeneous Face Recognition (NIR-VIS-UHFR). As the name suggests, the optimization of HERE follow a ”homogeneous-to-heterogeneous learning” strategy to fully explore complementary and common semantic information across different modalities. During the homogeneous learning phase, Modality-Adversarial Contrastive Learning (MACL) leverages the collaboration of modality contrastive learning and adversarial learning. On the one hand, MACL learns compact and discriminative intra-modal representations for NIR and VIS data, respectively. On the other hand, MACL guarantees that NIR-VIS data conform to the common feature distribution in a shared feature space, effectively reducing modal differences even in the absence of identity information between modalities. In the heterogeneous learning phase, K-reciprocal-Encoding-based Cross-modal Labeling (KECL) is introduced as robust pseudo label estimation to fully explore cross-modal relationships and group cross-modal features into clusters. With the pseudo labels provided by KECL, Refined cross-modal Contrastive Learning (RCL) is developed with modality-invariant averaging initialization and dynamic focus weighting strategies to extract modality-invariant features. Finally, Residual-invariant Representations Enhancement (RRE) mines partial features under the cross-modal face for robust matching. Compared to supervised methods, our unsupervised HERE demonstrates comparable performance on multiple datasets, greater scalability and practicality in deployment by reducing data acquisition requirements and costs.

Heterogeneous Face Recognition Via Grassmannian Based Nearest Subspace Search

Heterogeneous Face Recognition and Synthesis Using Canonical Correlation Analysis

Heterogeneous face biometrics based on Guassian weights and invariant features synthesis

Cross-spectral Face Completion for NIR-VIS Heterogeneous Face Recognition

Near infrared face recognition using Zernike moments and Hermite kernels

Shared Representation Learning for Heterogeneous Face Recognition

An Analysis-by-synthesis Method Based on Sparse Representation for Heterogeneous Face Biometrics

Adversarial Cross-Spectral Face Completion for NIR-VIS Face Recognition

Graphical Representation for Heterogeneous Face Recognition

Partial Face Matching between Near Infrared and Visual Images in MBGC Portal Challenge

Mutual Component Analysis for Heterogeneous Face Recognition.

Adversarial Discriminative Heterogeneous Face Recognition.

Unsupervised NIR-VIS Face Recognition Via Homogeneous-to-Heterogeneous Learning and Residual-Invariant Enhancement

Transductive VIS-NIR face matching

Sparse Graphical Representation Based Discriminant Analysis for Heterogeneous Face Recognition.

2D-3D Heterogeneous Face Recognition Based on Deep Coupled Spectral Regression

Matching NIR Face to VIS Face Using Transduction

Heterogeneous Face Recognition Based on Modality-Independent Kernel Fisher Discriminant Analysis Joint Sparse Auto-Encoder

Heterogeneous Face Recognition: A Common Encoding Feature Discriminant Approach

Re-Ranking High-Dimensional Deep Local Representation for NIR-VIS Face Recognition.

Near-infrared and visible light face recognition: a comprehensive survey