A Spatial-Channel Multi-Attention Parallel Network for Visible-Infrared Person Re-identification

Xu Zhang,Zhongliang Deng,Yao Zhang
DOI: https://doi.org/10.1109/seai62072.2024.10674327
2024-01-01
Abstract:Visible-infrared person re-identification(VI-ReID) aims to narrow the image differences between different modalities to achieve cross-device, full-time retrieval of pedestrians. Due to the differences in imaging principles, VI-ReID needs to overcome significant modal differences. Therefore, this paper proposes an innovative multi-attention parallel network. The model extracts feature from the two dimensions of space-channel through the attention mechanism, and aggregates high- and low-level features. This makes the global features retain multilevel and multidimensional common features, and narrows the feature differences between modalities. Then, the global features are mined by block, and multiple spatial-channel local attention modules are used to simultaneously explore local features in different positions, mining specific features and improving the discrimination of features. At the same time, the aligned local features are used to narrow the feature dislocation caused by the change of person posture. Comprehensive experiments show that the proposed model performs well on the SYSU-MM01 dataset, with Rank-1 reaching 76.35% and map reaching 78.23%.
What problem does this paper attempt to address?