Abstract:Person re-identification (Re-ID) is a challenging task in the field of computer vision and focuses on matching people across images from different cameras. The extraction of robust feature representations from pedestrian images through CNNs with a single deterministic pooling operation is problematic as the features in real pedestrian images are complex and diverse. To address this problem, we propose a novel center-triplet (CT) model that combines the learning of robust feature representation and the optimization of metric loss function. Firstly, we design a fusion feature learning network (FFLN) with a novel fusion strategy consisting of max pooling and average pooling. Instead of adopting a single deterministic pooling operation, the FFLN combines two pooling operations that can learn high response values, bright features, and low response values, discriminative features simultaneously. Our model obtains more discriminative fusion features by adaptively learning the weights of the features learned by the corresponding pooling operations. In addition, we design a hard mining center-triplet loss (HCTL), a novel improved triplet loss, which effectively optimizes the intra/inter-class distance and reduces the cost of computing and mining hard training samples simultaneously, thereby enhancing the learning of robust feature representation. Finally, we proved our method can learn robust and discriminative feature representations for complex pedestrian images in real scenes. The experimental results also illustrate that our method achieves an 81.8% mAP and a 93.8% rank-1 accuracy on Market1501, a 68.2% mAP and an 83.3% rank-1 accuracy on DukeMTMC-ReID, and a 43.6% mAP and a 74.3% rank-1 accuracy on MSMT17, outperforming most state-of-the-art methods and achieving better performance for person re-identification.

Fine-Grained Spatial Alignment Model for Person Re-Identification with Focal Triplet Loss.

A Loss Combination Based Deep Model for Person Re-Identification

Joining Features by Global Guidance with Bi-Relevance Trihard Loss for Person Re-Identification

Joint Uneven Channel Information Network with Blend Metric Loss for Person Re-Identification

Gaussian-based Probability Fusion for Person Re-Identification with Taylor Angular Margin Loss

Densely Semantically Aligned Person Re-Identification

Attend and Align: Improving Deep Representations with Feature Alignment Layer for Person Retrieval.

Discriminative Feature Learning with Foreground Attention for Person Re-identification.

Learning to Align Via Wasserstein for Person Re-Identification

Person Re-Identification with Triplet Focal Loss

Beyond the Parts: Learning Coarse-to-Fine Adaptive Alignment Representation for Person Search

Deeply-Learned Part-Aligned Representations for Person Re-identification.

Strong Feature Fusion Networks for Person Re-Identification

Pose-Guided Feature Alignment for Occluded Person Re-Identification

Foreground-guided textural-focused person re-identification

Concentrated Local Part Discovery with Fine-Grained Part Representation for Person Re-Identification.

Deep Fusion Feature Representation Learning With Hard Mining Center-Triplet Loss for Person Re-Identification

Semantics-Aligned Representation Learning for Person Re-Identification

Learning refined attribute-aligned network with attribute selection for person re-identification

Focus on the Visible Regions: Semantic-Guided Alignment Model for Occluded Person Re-Identification.

Recurrent matching networks of spatial alignment learning for person re-identification