Abstract:Visible-Infrared Person Re-identification (VI-ReID) is a challenging cross-modal pedestrian retrieval task, due to significant intra-class variations and cross-modal discrepancies among different cameras. Existing works mainly focus on embedding images of different modalities into a unified space to mine modality-shared features. They only seek distinctive information within these shared features, while ignoring the identity-aware useful information that is implicit in the modality-specific features. To address this issue, we propose a novel Implicit Discriminative Knowledge Learning (IDKL) network to uncover and leverage the implicit discriminative information contained within the modality-specific. First, we extract modality-specific and modality-shared features using a novel dual-stream network. Then, the modality-specific features undergo purification to reduce their modality style discrepancies while preserving identity-aware discriminative knowledge. Subsequently, this kind of implicit knowledge is distilled into the modality-shared feature to enhance its distinctiveness. Finally, an alignment loss is proposed to minimize modality discrepancy on enhanced modality-shared features. Extensive experiments on multiple public datasets demonstrate the superiority of IDKL network over the state-of-the-art methods. Code is available at

What problem does this paper attempt to address?

The problem that this paper attempts to solve is that in the visible - light and infrared image cross - modal person re - identification (VI - ReID) task, due to the significant differences between different modalities and the large variations within the same category, existing methods ignore the implicit discriminative information in modality - specific features when extracting modality - shared features. Specifically, existing VI - ReID methods mainly focus on embedding images of different modalities into a unified space to mine modality - shared features. However, these methods only seek the significant information in these shared features and ignore the useful identity - aware information implicit in modality - specific features. This limits the upper limit of the discriminative ability of feature representations. To address this problem, the authors propose a novel Implicit Discriminative Knowledge Learning (IDKL) network, aiming to reveal and utilize the implicit discriminative information contained in modality - specific features. Through this method, the IDKL network can enhance the saliency of modality - shared features, thereby improving the performance of the VI - ReID task. Specific technical contributions include: 1. **Proposing the IDKL network**: Using the discriminative knowledge in modality - specific features to enhance the discriminative ability of modality - shared features. 2. **Designing the IN - guided Information Purifier (IP)**: Reducing the modality - style differences while retaining the discriminative knowledge in modality - specific information. 3. **Developing the TGSA loss**: Distilling discriminative modality - specific information into modality - shared features at the feature level and fully reducing the inter - modality differences of modality - shared features. Through these technical means, the IDKL framework can improve the discriminative ability while maintaining the invariance of modality - shared features. Thus, experimental results on multiple public datasets show that this method outperforms existing state - of - the - art methods.

Implicit Discriminative Knowledge Learning for Visible-Infrared Person Re-Identification

Knowledge self-distillation for visible-infrared cross-modality person re-identification

Progressive Discriminative Feature Learning for Visible-Infrared Person Re-Identification

Stronger Heterogeneous Feature Learning for Visible-Infrared Person Re-Identification

Implicit Modality Knowledge Alignment and Uncertainty Estimation for visible-infrared person re-identification

Dual adaptive alignment and partitioning network for visible and infrared cross-modality person re-identification

Joint Color-irrelevant Consistency Learning and Identity-aware Modality Adaptation for Visible-infrared Cross Modality Person Re-identification.

Modality Bias Calibration Network Via Information Disentanglement for Visible–Infrared Person Reidentification

Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person Re-Identification

Frequency Domain Modality-invariant Feature Learning for Visible-infrared Person Re-Identification

DMA: Dual Modality-Aware Alignment for Visible-Infrared Person Re-Identification

Hi-CMD: Hierarchical Cross-Modality Disentanglement for Visible-Infrared Person Re-Identification

Cooperative Separation of Modality Shared-Specific Features for Visible-Infrared Person Re-Identification

Pose-Guided Feature Learning with Knowledge Distillation for Occluded Person Re-Identification.

Dynamic Identity-Guided Attention Network for Visible-Infrared Person Re-identification

Unbiased Feature Learning with Causal Intervention for Visible-Infrared Person Re-identification

Co-segmentation assisted cross-modality person re-identification

Inter-Intra Modality Knowledge Learning and Clustering Noise Alleviation for Unsupervised Visible-Infrared Person Re-Identification

Cross-modality disentanglement and shared feedback learning for infrared-visible person re-identification

Adaptive Middle Modality Alignment Learning for Visible-Infrared Person Re-identification

Visible-Infrared Person Re-Identification Based on Frequency-Domain Simulated Multispectral Modality for Dual-Mode Cameras