Abstract:Person re-identification (Re-ID) is an important problem in video surveillance for matching pedestrian images across non-overlapping camera views. Currently, most works focus on RGB-based Re-ID. However, RGB images are not well suited to a dark environment; consequently, infrared (IR) imaging becomes necessary for indoor scenes with low lighting and 24-h outdoor scene surveillance systems. In such scenarios, matching needs to be performed between RGB images and IR images, which exhibit different visual characteristics; this cross-modality matching problem is more challenging than RGB-based Re-ID due to the lack of visible colour information in IR images. To address this challenge, we study the RGB-IR cross-modality Re-ID (RGB-IR Re-ID) problem. Rather than applying existing cross-modality matching models that operate under the assumption of identical data distributions between training and testing sets to handle the discrepancy between RGB and IR modalities for Re-ID, we cast learning shared knowledge for cross-modality matching as the problem of cross-modality similarity preservation. We exploit same-modality similarity as the constraint to guide the learning of cross-modality similarity along with the alleviation of modality-specific information, and finally propose a Focal Modality-Aware Similarity-Preserving Loss. To further assist the feature extractor in extracting shared knowledge, we design a modality-gated node as a universal representation of both modality-specific and shared structures for constructing a structure-learnable feature extractor called Modality-Gated Extractor. For validation, we construct a new multi-modality Re-ID dataset, called SYSU-MM01, to enable wider study of this problem. Extensive experiments on this SYSU-MM01 dataset show the effectiveness of our method. Download link of dataset: <a href="https://github.com/wuancong/SYSU-MM01">https://github.com/wuancong/SYSU-MM01</a>.

Multi-scale feature correspondence and restriction mechanism for visible X-ray baggage re-Identification

Contribution-Based Multi-Stream Feature Distance Fusion Method with ${k}$ -Distribution Re-Ranking for Person Re-Identification

Contribution-Based Multi-Stream Feature Distance Fusion Method With <inline-formula> <tex-math notation="LaTeX">${k}$ </tex-math></inline-formula>-Distribution Re-Ranking for Person Re-Identification

Representation Selective Coupling Via Token Sparsification for Multi-Spectral Object Re-Identification

Feature separation and double causal comparison loss for visible and infrared person re-identification

Multi-scale Semantic Correlation Mining for Visible-Infrared Person Re-Identification

Cross-Modality Person Re-identification with Memory-Based Contrastive Embedding

Cooperative Separation of Modality Shared-Specific Features for Visible-Infrared Person Re-Identification

RGB-IR Person Re-identification by Cross-Modality Similarity Preservation

Joint Color-irrelevant Consistency Learning and Identity-aware Modality Adaptation for Visible-infrared Cross Modality Person Re-identification.

Visible-Infrared Person Re-Identification Based on Frequency-Domain Simulated Multispectral Modality for Dual-Mode Cameras

Bridging the Gap: Multi-level Cross-modality Joint Alignment for Visible-infrared Person Re-identification

Co-segmentation assisted cross-modality person re-identification

Multi-Scale Cascading Network with Compact Feature Learning for RGB-Infrared Person Re-Identification.

Cross-modal Local Shortest Path and Global Enhancement for Visible-Thermal Person Re-Identification

Cross-modality disentanglement and shared feedback learning for infrared-visible person re-identification

Multi-Memory Matching for Unsupervised Visible-Infrared Person Re-Identification

Cross Vision-RF Gait Re-identification with Low-cost RGB-D Cameras and mmWave Radars

Cross-Modality Spatial-Temporal Transformer for Video-Based Visible-Infrared Person Re-Identification

Cross-Spectrum Dual-Subspace Pairing for RGB-infrared Cross-Modality Person Re-Identification

Efficient Bilateral Cross-Modality Cluster Matching for Unsupervised Visible-Infrared Person ReID