Visible-Infrared Person Re-Identification Via Patch-Mixed Cross-Modality Learning.

Zhihao Qian,Yutian Lin,Bo Du
DOI: https://doi.org/10.1016/j.patcog.2024.110873
IF: 8
2025-01-01
Pattern Recognition
Abstract:Visible-infrared person re-identification (VI-ReID) aims to retrieve imagesof the same pedestrian from different modalities, where the challenges lie inthe significant modality discrepancy. To alleviate the modality gap, recentmethods generate intermediate images by GANs, grayscaling, or mixup strategies.However, these methods could introduce extra data distribution, and thesemantic correspondence between the two modalities is not well learned. In thispaper, we propose a Patch-Mixed Cross-Modality framework (PMCM), where twoimages of the same person from two modalities are split into patches andstitched into a new one for model learning. A part-alignment loss is introducedto regularize representation learning, and a patch-mixed modality learning lossis proposed to align between the modalities. In this way, the model learns torecognize a person through patches of different styles, thereby the modalitysemantic correspondence can be inferred. In addition, with the flexible imagegeneration strategy, the patch-mixed images freely adjust the ratio ofdifferent modality patches, which could further alleviate the modalityimbalance problem. On two VI-ReID datasets, we report new state-of-the-artperformance with the proposed method.
What problem does this paper attempt to address?