Implicit Discriminative Knowledge Learning for Visible-Infrared Person Re-Identification

Kaijie Ren,Lei Zhang
2024-03-26
Abstract:Visible-Infrared Person Re-identification (VI-ReID) is a challenging cross-modal pedestrian retrieval task, due to significant intra-class variations and cross-modal discrepancies among different cameras. Existing works mainly focus on embedding images of different modalities into a unified space to mine modality-shared features. They only seek distinctive information within these shared features, while ignoring the identity-aware useful information that is implicit in the modality-specific features. To address this issue, we propose a novel Implicit Discriminative Knowledge Learning (IDKL) network to uncover and leverage the implicit discriminative information contained within the modality-specific. First, we extract modality-specific and modality-shared features using a novel dual-stream network. Then, the modality-specific features undergo purification to reduce their modality style discrepancies while preserving identity-aware discriminative knowledge. Subsequently, this kind of implicit knowledge is distilled into the modality-shared feature to enhance its distinctiveness. Finally, an alignment loss is proposed to minimize modality discrepancy on enhanced modality-shared features. Extensive experiments on multiple public datasets demonstrate the superiority of IDKL network over the state-of-the-art methods. Code is available at
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that in the visible - light and infrared image cross - modal person re - identification (VI - ReID) task, due to the significant differences between different modalities and the large variations within the same category, existing methods ignore the implicit discriminative information in modality - specific features when extracting modality - shared features. Specifically, existing VI - ReID methods mainly focus on embedding images of different modalities into a unified space to mine modality - shared features. However, these methods only seek the significant information in these shared features and ignore the useful identity - aware information implicit in modality - specific features. This limits the upper limit of the discriminative ability of feature representations. To address this problem, the authors propose a novel Implicit Discriminative Knowledge Learning (IDKL) network, aiming to reveal and utilize the implicit discriminative information contained in modality - specific features. Through this method, the IDKL network can enhance the saliency of modality - shared features, thereby improving the performance of the VI - ReID task. Specific technical contributions include: 1. **Proposing the IDKL network**: Using the discriminative knowledge in modality - specific features to enhance the discriminative ability of modality - shared features. 2. **Designing the IN - guided Information Purifier (IP)**: Reducing the modality - style differences while retaining the discriminative knowledge in modality - specific information. 3. **Developing the TGSA loss**: Distilling discriminative modality - specific information into modality - shared features at the feature level and fully reducing the inter - modality differences of modality - shared features. Through these technical means, the IDKL framework can improve the discriminative ability while maintaining the invariance of modality - shared features. Thus, experimental results on multiple public datasets show that this method outperforms existing state - of - the - art methods.