GLSFF: Global–local Specific Feature Fusion for Cross-Modality Pedestrian Re-Identification

Chen Xue,Zhongliang Deng,Shuo Wang,Enwen Hu,Yao Zhang,Wangwang Yang,Yiming Wang
DOI: https://doi.org/10.1016/j.comcom.2023.12.035
IF: 5.047
2024-01-01
Computer Communications
Abstract:Cross-modality Pedestrian Re-identification (Pedestrian ReID) is an image retrieval technique used to match targets of interest in a library. The main challenge of Pedestrian cm-ReID is that the modality gap between visible and infrared images reduces the recognition effect. To reduce the gap, we propose a novel Pedestrian cm-ReID model called Global–Local Specific Feature Fusion (GLSFF) to integrate the person features extracted by backbone network. It contains the Global Feature Fusion Module (GFFM) and the Local Feature Fusion Module (LFFM). LFFM and GFFM fuse the local specific and global shared features of pedestrians respectively. Fusion features mitigate the gap modality in images and retain discriminative information, such as gait and silhouette. In addition, we propose a joint training method, which combines Heterogeneous Center Loss (HC Loss), Triplet Loss and Cross-Entropy Loss (CE Loss) to accelerate the convergence rate of the model. Extensive experiments were conducted on two popular datasets, SYSU-MM01 and RegDB, and the mAP of GLSFF reached 64.47% and 81.37%, respectively.
What problem does this paper attempt to address?