Diverse Part Attentive Network for Video-Based Person Re-Identification *
Xiujun Shu,Ge Li,Longhui Wei,Jia-Xing Zhong,Xianghao Zang,Shiliang Zhang,Yaowei Wang,Yongsheng Liang,Qi Tian
DOI: https://doi.org/10.1016/j.patrec.2021.05.020
IF: 4.757
2021-01-01
Pattern Recognition Letters
Abstract:Attention mechanisms have achieved success in video-based person re-identification (re-ID). However, current global attentions tend to focus on the most salient parts, e.g., clothes, and ignore other subtle but valuable cues, e.g., hair, bag, and shoes. They still do not make full use of valuable information from diverse parts of human bodies. To tackle this issue, we propose a Diverse Part Attentive Network (DPAN) to exploit discriminative and diverse body cues. The framework consists of two modules: spatial diverse part attention and temporal diverse part attention. The spatial module utilizes channel grouping to exploit diverse parts of human bodies including salient and subtle parts. The temporal module aims to learn diverse weights for fusing learned features. Besides, this framework is lightweight, which introduces marginal parameters and computational complexities. Extensive experiments were conducted on three popular benchmarks, i.e. iLIDS-VID, PRID2011 and MARS. Our method achieves competitive performance on these datasets compared with state-of-the-art methods. (c) 2021 Elsevier B.V. All rights reserved.