Abstract:The past few years in the fields of Person Re-Identification (RE-ID) have seen attention mechanism receives enormous interest as it has superior performance in obtaining discriminative feature representations. However, a wide range of state-of-the-art RE-ID attention models only focus on one-dimensional attention design method, e.g. spatial attention and channels attention, hence the produced attention maps are neither detailed enough nor discriminative enough to capture complicated interactions of visual parts. Developing multi-scale attention mechanism for RE-ID, an under-studied approach, becomes a practicable method to overcome this deficiency. Toward this goal, we propose a Multiscale Omnibearing Attention Networks (MOAN) for RE-ID which is capable of utilizing the complex fusion information acquired from the multiscale attention mechanism with features being more representative. Specifically, MOAN takes full advantage of multi-sized convolution filters to obtain discriminative holistic and local feature maps, and adaptively conducts feature information augmentation by introducing an Omnibearing Attention (OA) module. Through the OA module, spatial attention and channel attention are integrated together in a unique way where they work in a complementary way. To sum up, MOAN not only inherits the merit of two kinds of attention mechanism but also performs well in extracting comprehensive feature information. Furthermore, taking into account the robustness of model performance, we formulate a Random Drop (RD) Function to facilitate training MOAN and further increase the diversity of training model for adaptation. Furthermore, to achieve end-to-end training, we utilize trainable parameters to take place of initial fixed parameters, and the model performance is experimentally promoted. Extensive experiments have been carried out on the four mainstream RE-ID datasets. As the result shows, our method with re-ranking achieves rank-1 accuracy of 92.29% on CUHK03-NP, 97.45% on Market-1501, 93.81% on DukeMTMC-reID and 81.53% on MSMT17-V2, outperforming the state-of-the-art methods and confirming the effectiveness of our method.

IMG-Net: Inner-Cross-modal Attentional Multigranular Network for Description-Based Person Re-Identification.

Multi-granularity Cross Attention Network for Person Re-Identification

A Cross-Modal Multi-granularity Attention Network for RGB-IR Person Re-identification

Dual Attention Matching Network for Context-Aware Feature Sequence based Person Re-Identification

A Local-Global Self-attention Interaction Network for RGB-D Cross-Modal Person Re-identification.

Multiple Biological Granularities Network for Person Re-Identification

Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments

Multiscale Omnibearing Attention Networks for Person Re-Identification

Multiscale Global-Aware Channel Attention for Person Re-identification

Concentrated Multi-Grained Multi-Attention Network for Video Based Person Re-Identification

LOCAL TO GLOBAL WITH MULTI-SCALE ATTENTION NETWORK FOR PERSON RE-IDENTIFICATION

Cross-Modality Person Re-Identification Method with Joint-Modality Generation and Feature Enhancement

VMRFANet:View-Specific Multi-Receptive Field Attention Network for Person Re-identification

Information complementary attention-based multidimension feature learning for person re-identification

Integration Graph Attention Network and Multi‐centre Constrained Loss for Cross‐modality Person Re‐identification

A part-based attention network for person re-identification

Multi-layer Attention for Person Re-Identification

A Dual‐modal Graph Attention Interaction Network for Person Re‐identification

Inter-Modality Similarity Learning for Unsupervised Multi-Modality Person Re-Identification

Dual Branch Attention Network for Person Re-Identification

Complementation-Reinforced Attention Network for Person Re-Identification