Abstract:Visible-infrared person re-identification has attracted extensive attention from the community due to its potential great application prospects in video surveillance. There are huge modality discrepancies between visible and infrared images caused by different imaging mechanisms. Existing studies alleviate modality discrepancies by aligning modality distribution or extracting modality-shared features on the original image. However, they ignore a key solution, i.e., converting visible images to gray images directly, which is efficient and effective to reduce modality discrepancies. In this paper, we transform the cross-modality person re-identification task from visible-infrared images to gray-infrared images, which is named as the minimal modality discrepancy. In addition, we propose a pyramid feature integration network (PFINet) which mines the discriminative refined features of pedestrian images and fuses high-level and semantically strong features to build a robust pedestrian representation. Specifically, PFINet first performs the feature extraction from concrete to abstract and the top-down semantic transfer to obtain multi-scale feature maps. Second, the multi-scale feature maps are inputted to the discriminative-region response module to emphasize the identity-discriminative regions by the spatial attention mechanism. Finally, the pedestrian representation is obtained by the feature integration. Extensive experiments demonstrate the effectiveness of PFINet which achieves the rank-1 accuracy of 81.95% and mAP of 74.49% on the multi-all evaluation mode of the SYSU-MM01 dataset.

Pose-Invariant Embedding for Deep Person Re-Identification

A Novel Two-Stream Saliency Image Fusion CNN Architecture for Person Re-Identification

Re-Identifying Pedestrians Via Part Based Method

Deep Siamese Network with Multi-level Similarity Perception for Person Re-identification

Person Re-Identification with Effectively Designed Parts

Pedestrian Alignment Network for Large-scale Person Re-Identification

Deep-Person: Learning discriminative deep features for person Re-Identification

Pose-Guided Representation Learning for Person Re-Identification

A divide-and-unite deep network for person re-identification

Learn Robust Pedestrian Representation Within Minimal Modality Discrepancy for Visible-Infrared Person Re-Identification

Pedestrian Re-ID based on feature consistency and contrast enhancement

Pose-driven Deep Convolutional Model for Person Re-identification

Pose-Aided Video-based Person Re-Identification via Recurrent Graph Convolutional Network

Part-Weighted Deep Representation Learning for Person Re-Identification

AN ADAPTIVE PART-BASED MODEL FOR PERSON RE-IDENTIFICATION

Pose-Guided Feature Alignment for Occluded Person Re-Identification

An End-to-End Foreground-Aware Network for Person Re-Identification

Enhance Part-Based Model for Person Re-Identification with Fused Multi-Scale Features

Person Re-Identification Network Based on Edge-Enhanced Feature Extraction and Inter-Part Relationship Modeling

Deeply-Learned Part-Aligned Representations for Person Re-identification.

Attribute-Guided Collaborative Learning for Partial Person Re-Identification