Dual-branch Self-Attention Network for Pedestrian Attribute Recognition

Zhenyu Liu,Zhang,Da Li,Peng Zhang,Caifeng Shan
DOI: https://doi.org/10.1016/j.patrec.2022.10.003
IF: 4.757
2022-01-01
Pattern Recognition Letters
Abstract:Pedestrian attribute recognition (PAR) is still a challenging task in real surveillance scenes, where the dif-ficulties, such as occlusion, complex background, and varying views, degrade the recognition accuracy. To fully exploit attribute correlation and regional context, we propose a dual-branch self-attention network for PAR: (1) For the attribute branch, the second-order self-attention module (SO-SAM) is first introduced to derive the second-order feature maps; they are then fused with the first-order information to learn unique features for each attribute using the constrained loss function. (2) For the context branch, multiple adaptive visual tokens and a group of multi-head context self-attention modules (C-SAM) are exploited to describe the image and explore the relationships between different regions. The experimental results on three main public benchmarks, RAP, PA100K, and PETA datasets, demonstrate the effectiveness of the proposed method.(c) 2022 Elsevier B.V. All rights reserved.
What problem does this paper attempt to address?