Abstract:As a sub-direction of image retrieval, person re-identification (Re-ID) is usually used to solve the security problem of cross camera tracking and monitoring. A growing number of shopping centers have recently attempted to apply Re-ID technology. One of the development trends of related algorithms is using an attention mechanism to capture global and local features. We notice that these algorithms have apparent limitations. They only focus on the most salient features without considering certain detailed features. People's clothes, bags and even shoes are of great help to distinguish pedestrians. We notice that global features usually cover these important local features. Therefore, we propose a dual branch network based on a multi-scale attention mechanism. This network can capture apparent global features and inconspicuous local features of pedestrian images. Specifically, we design a dual branch attention network (DBA-Net) for better performance. These two branches can optimize the extracted features of different depths at the same time. We also design an effective block (called channel, position and spatial-wise attention (CPSA)), which can capture key fine-grained information, such as bags and shoes. Furthermore, based on ID loss, we use complementary triplet loss and adaptive weighted rank list loss (WRLL) on each branch during the training process. DBA-Net can not only learn semantic context information of the channel, position, and spatial dimensions but can integrate detailed semantic information by learning the dependency relationships between features. Extensive experiments on three widely used open-source datasets proved that DBA-Net clearly yielded overall state-of-the-art performance. Particularly on the CUHK03 dataset, the mean average precision (mAP) of DBA-Net achieved 83.2%.

CA3Net: Contextual-Attentional Attribute-Appearance Network for Person Re-Identification

A Novel Two-Stream Saliency Image Fusion CNN Architecture for Person Re-Identification

Deep Siamese Network with Multi-level Similarity Perception for Person Re-identification

Temporal Attribute-Appearance Learning Network for Video-based Person Re-Identification

Dual Attention Matching Network for Context-Aware Feature Sequence based Person Re-Identification

Information complementary attention-based multidimension feature learning for person re-identification

Dense 3D-Convolutional Neural Network for Person Re-Identification in Videos

Attribute-Guided Global and Part-Level Identity Network for Person Re-Identification

Dual Branch Attention Network for Person Re-Identification

Dual-branch Self-Attention Network for Pedestrian Attribute Recognition

An End-to-End Foreground-Aware Network for Person Re-Identification

An efficient feature pyramid attention network for person re-identification

Deep-Person: Learning discriminative deep features for person Re-Identification

Person Re-Identification Based on Spatial Feature Learning and Multi-Granularity Feature Fusion

Triplet Attention Network for Video-Based Person Re-Identification

Improved Res2Net model for Person re-identification

Co-Saliency Spatio-Temporal Interaction Network for Person Re-Identification in Videos

Temporal-Contextual Attention Network for Video-Based Person Re-identification

Attribute-Guided Collaborative Learning for Partial Person Re-Identification

Adaptive Alignment Network for Person Re-identification.

Person Re-Identification Network Based on Edge-Enhanced Feature Extraction and Inter-Part Relationship Modeling