Abstract:Visible-infrared person re-identification (VI-ReID) aims to identify the same person across visible and infrared images. Its main challenge is how to extract modality-irrelevant person identity information. To alleviate cross-modality discrepancies, existing methods typically follow two paradigms: 1) Transform visible images into gray-scale color space and map them into the infrared domain. 2) Stack infrared images into RGB color space and map them into the visible domain. However, limited by different optical properties of visible and infrared waves, such mapping commonly leads to information asymmetry. Although some efforts prevent such discrepancies by data-level alignment, they typically meanwhile introduce misleading information and bring extra divergence. Therefore, existing methods fail on effectively eliminating the modality discrepancies. In this paper, we first analyze the essential factors to the generation of modality discrepancies. Secondly, we propose a novel Dual Modality-aware Alignment (DMA) model for VI-ReID, which can preserve discriminative identity information and suppress the misleading information within a uniform scheme. Particularly, based on the intrinsic optical properties of both modalities, a Dual Modality Transfer (DMT) module is proposed to perform compensation for the information asymmetry in HSV color space, thereby effectively alleviating cross-modality discrepancies and better preserving discriminative identity features. Further, an Intra-local Alignment (IA) module is proposed to suppress the misleading information, where a fine-grained local consistency objective function is designed to achieve more compact intra-class representations. Extensive experiments on several benchmark datasets demonstrate the effectiveness of our method and competitive performance with state-of-the-art methods. The source code of this paper is available at https://github.com/PKU-ICST-MIPL/DMA_TIFS2023.

Dual Pseudo-Labels Interactive Self-Training for Semi-Supervised Visible-Infrared Person Re-Identification

Modality-transfer Generative Adversarial Network and Dual-Level Unified Latent Representation for Visible Thermal Person Re-Identification

Robust Pseudo-label Learning with Neighbor Relation for Unsupervised Visible-Infrared Person Re-Identification

Semi-Supervised Learning With Heterogeneous Distribution Consistency for Visible Infrared Person Re-Identification

Inter-Intra Modality Knowledge Learning and Clustering Noise Alleviation for Unsupervised Visible-Infrared Person Re-Identification

Dual Knowledge Distillation on Multiview Pseudo Labels for Unsupervised Person Re-Identification

Unsupervised Visible-Infrared Person ReID by Collaborative Learning with Neighbor-Guided Label Refinement

Unsupervised Visible-Infrared ReID via Pseudo-label Correction and Modality-level Alignment

Cooperative Separation of Modality Shared-Specific Features for Visible-Infrared Person Re-Identification

DMA: Dual Modality-Aware Alignment for Visible-Infrared Person Re-Identification

Video-based Person Re-Identification by Semi-Supervised Adaptive Stepwise Learning

Cross-modality Hierarchical Clustering and Refinement for Unsupervised Visible-Infrared Person Re-Identification

Modality-Adaptive Mixup and Invariant Decomposition for RGB-Infrared Person Re-Identification

MIMR: Modality-Invariance Modeling and Refinement for unsupervised visible-infrared person re-identification

Unified pre-training with pseudo infrared images for visible-infrared person re-identification

Dual adaptive alignment and partitioning network for visible and infrared cross-modality person re-identification

Co-segmentation assisted cross-modality person re-identification

Progressive Discriminative Feature Learning for Visible-Infrared Person Re-Identification

Joint Color-irrelevant Consistency Learning and Identity-aware Modality Adaptation for Visible-infrared Cross Modality Person Re-identification.

Translation, Association and Augmentation: Learning Cross-Modality Re-Identification From Single-Modality Annotation

Multi-Memory Matching for Unsupervised Visible-Infrared Person Re-Identification