Abstract:Person re-identification is still a challenging task when moving objects or another person occludes the probe person. Mainstream methods based on even partitioning apply an off-the-shelf human semantic parsing to highlight the non-collusion part. In this paper, we apply an attention branch to learn the human semantic partition to avoid misalignment introduced by even partitioning. In detail, we propose a semantic attention branch to learn 5 human semantic maps. We also note that some accessories or belongings, such as a hat, bag, may provide more informative clues to improve the person Re-ID. Human semantic parsing, however, usually treats non-human parts as distractions and discards them. To fetch the missing clues, we design a branch to capture the salient non-human parts. Finally, we merge the semantic and saliency attention to build an end-to-end network, named as S <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$^{2}$</tex-math></inline-formula> -Net. Specifically, to further improve Re-ID, we develop a trade-off weighting scheme between semantic and saliency attention and set the right weight with the actual scene. The extensive experiments show that S <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$^{2}$</tex-math></inline-formula> -Net gets the competitive performance. S <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$^{2}$</tex-math></inline-formula> -Net achieves 87.4% mAP on Market1501 and obtains 79.3%/56.1% rank-1/mAP on MSMT17 without semantic supervision. The source codes are available at <uri xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">https://github.com/upgirlnana/S2Net</uri> .

S<SUP>2</SUP>-Net:Semantic and Saliency Attention Network for Person Re-Identification

S$^{2}$-Net:Semantic and Saliency Attention Network for Person Re-Identification.

S<inline-formula><tex-math notation="LaTeX">$^{2}$</tex-math></inline-formula>-Net:Semantic and Saliency Attention Network for Person Re-Identification

A Novel Two-Stream Saliency Image Fusion CNN Architecture for Person Re-Identification

Deep Siamese Network with Multi-level Similarity Perception for Person Re-identification

Person Re-identification Network Based on Multi-Level Feature Fusion

Improving the Accuracy of Person Re- Identification by Mining Semantic Features and Applying New Attention Mechanism

Dual Attention Matching Network for Context-Aware Feature Sequence based Person Re-Identification

Dual Branch Attention Network for Person Re-Identification

Semantic-Aware Occlusion-Robust Network for Occluded Person Re-Identification

A Supervisory Mask Attentional Network for Person Re-Identification in Uniform Dress Scenes.

Semantically enhanced attention map‐driven occluded person re‐identification

MIX-Net: Hybrid Attention/Diversity Network for Person Re-Identification

Semantics-Aligned Representation Learning for Person Re-Identification

Salience-Guided Cascaded Suppression Network for Person Re-identification

Person Re-Identification Based on Visual Saliency

VMRFANet:View-Specific Multi-Receptive Field Attention Network for Person Re-identification

Multi-level Similarity Perception Network for Person Re-identification

Improved Person Re-Identification Based on Saliency and Semantic Parsing with Deep Neural Network Models

Attention Driven Person Re-Identification.

Semantically Self-Aligned Network for Text-to-Image Part-aware Person Re-identification