Abstract:Person re-identification (Re-ID) is a challenging task in the field of computer vision and focuses on matching people across images from different cameras. The extraction of robust feature representations from pedestrian images through CNNs with a single deterministic pooling operation is problematic as the features in real pedestrian images are complex and diverse. To address this problem, we propose a novel center-triplet (CT) model that combines the learning of robust feature representation and the optimization of metric loss function. Firstly, we design a fusion feature learning network (FFLN) with a novel fusion strategy consisting of max pooling and average pooling. Instead of adopting a single deterministic pooling operation, the FFLN combines two pooling operations that can learn high response values, bright features, and low response values, discriminative features simultaneously. Our model obtains more discriminative fusion features by adaptively learning the weights of the features learned by the corresponding pooling operations. In addition, we design a hard mining center-triplet loss (HCTL), a novel improved triplet loss, which effectively optimizes the intra/inter-class distance and reduces the cost of computing and mining hard training samples simultaneously, thereby enhancing the learning of robust feature representation. Finally, we proved our method can learn robust and discriminative feature representations for complex pedestrian images in real scenes. The experimental results also illustrate that our method achieves an 81.8% mAP and a 93.8% rank-1 accuracy on Market1501, a 68.2% mAP and an 83.3% rank-1 accuracy on DukeMTMC-ReID, and a 43.6% mAP and a 74.3% rank-1 accuracy on MSMT17, outperforming most state-of-the-art methods and achieving better performance for person re-identification.

Attend and Align: Improving Deep Representations with Feature Alignment Layer for Person Retrieval.

A Loss Combination Based Deep Model for Person Re-Identification

Joining Features by Global Guidance with Bi-Relevance Trihard Loss for Person Re-Identification

Fine-Grained Spatial Alignment Model for Person Re-Identification with Focal Triplet Loss.

Discriminative Feature Learning with Foreground Attention for Person Re-identification.

Deeply-Learned Part-Aligned Representations for Person Re-identification.

Beyond the Parts: Learning Coarse-to-Fine Adaptive Alignment Representation for Person Search

Discriminative Feature Learning with Consistent Attention Regularization for Person Re-Identification

Deep-Person: Learning discriminative deep features for person Re-Identification

Combining Multilevel Feature Extraction and Multi-Loss Learning for Person Re-Identification

Progressive Feature Alignment for Occluded Person Re-Identification

Learning refined attribute-aligned network with attribute selection for person re-identification

Deep Fusion Feature Representation Learning With Hard Mining Center-Triplet Loss for Person Re-Identification

Strong Feature Fusion Networks for Person Re-Identification

Multi-level Feature Learning with Attention for Person Re-Identification.

Densely Semantically Aligned Person Re-Identification

DeepList: Learning Deep Features With Adaptive Listwise Constraint for Person Reidentification.

Adaptive Re-ranking of Deep Feature for Person Re-identification

Learning Concordant Attention Via Target-aware Alignment for Visible-Infrared Person Re-identification

Information complementary attention-based multidimension feature learning for person re-identification

Harmonious Attention Network for Person Re-Identification