AVPL: Augmented Visual Perception Learning for Person Re-identification and Beyond

Yewen Huang,Sicheng Lian,Haifeng Hu
DOI: https://doi.org/10.1016/j.patcog.2022.108736
IF: 8
2022-01-01
Pattern Recognition
Abstract:In this work, we propose an Augmented Visual Perception Learning (AVPL) method for Person Re-identification (ReID) which is inspired by the two-stream hypothesis theory of Human Visual System (HVS). Deep learning methods dominate ReID and many state-of-the-art performances are achieved from the perspective of optimizing the model of 'what' visual pathway. It does not blend 'what' and 'where' well. Our AVPL method uses the essential mechanism of the ventro-dorsal stream of the 'where' visual pathway to expand the perception field of the model, and integrates with the 'what' to complete the information of the visually salient regions. A novel Batch Attention (BA), the key component of our Aug-mented Visual Perception (AVP) module, is proposed to apply fusion and augmentation into all input fea-ture maps within each batch. Through AVP module, the improved attention-based model attaches more importance to enhancement of salient features, therefore acquiring better perceptual ability of salient regions which provide the most distinguishably distinctions for ReID. Extensive experiments have been carried out on four main stream ReID datasets and two recognition datasets. In terms of ReID, our method achieves rank-1 accuracy of 95.2% on CUHK03-NP, 98.7% on Market-1501, 96.0% on DukeMTMC-reID and 88.8% on MSMT17-V1, outperforming the state-of-the-art methods by a large margin. Besides, it has been experimentally proven to be applicable and effective in other recognition tasks including facial expression recognition and action recognition with an improved accuracy.(c) 2022 Elsevier Ltd. All rights reserved.
What problem does this paper attempt to address?