Abstract:Occluded person re-identification (Re-ID) is a challenging task, as pedestrians are often obstructed by various occlusions, such as non-pedestrian objects or non-target pedestrians. Previous methods have heavily relied on auxiliary models to obtain information in unoccluded regions, such as human pose estimation. However, these auxiliary models fall short in accounting for pedestrian occlusions, thereby leading to potential misrepresentations. In addition, some previous works learned feature representations from single images, ignoring the potential relations among samples. To address these issues, this paper introduces a Multi-Level Relation-Aware Transformer (MLRAT) model for occluded person Re-ID. This model mainly encompasses two novel modules: Patch-Level Relation-Aware (PLRA) and Sample-Level Relation-Aware (SLRA). PLRA learns fine-grained local features by modeling the structural relations between key patches, bypassing the dependency on auxiliary models. It adopts a model-free method to select key patches that have high semantic correlation with the final pedestrian representation. In particular, to alleviate the interference of occlusion, PLRA captures the structural relations among key patches via a two-layer Graph Convolution Network (GCN), effectively guiding the local feature fusion and learning. SLRA is designed to facilitate the model to learn discriminative features by modeling the relations among samples. Specifically, to mitigate noisy relations of irrelevant samples, we present a Relation-Aware Transformer (RAT) block to capture the relations among neighbors. Furthermore, to bridge the gap between training and testing phases, a self-distillation method is employed to transfer the sample-level relations captured by SLRA to the backbone. Extensive experiments are conducted on four occluded datasets, two partial datasets and two holistic datasets. The results show that the proposed MLRAT model significantly outperforms existing baselines on four occluded datasets, while maintains top performance on two partial datasets and two holistic datasets.

Self-Guided Body Part Alignment with Relation Transformers for Occluded Person Re-Identification

Person Re-identification Based on Transform Algorithm

Joining Features by Global Guidance with Bi-Relevance Trihard Loss for Person Re-Identification

Learning transformer-based attention region with multiple scales for occluded person re-identification

Pose-guided Inter- and Intra-part Relational Transformer for Occluded Person Re-Identification.

Point-level feature learning based on vision transformer for occluded person re-identification

MP2PMatch: A Mask-guided Part-to-Part Matching network based on transformer for occluded person re-identification

Pose-guided Feature Disentangling for Occluded Person Re-identification Based on Transformer

Occlusion-Aware Transformer With Second-Order Attention for Person Re-Identification

Part-aware Network: a Simple but Efficient Method for Occluded Person Re-Identification

A Multi-Level Relation-Aware Transformer model for occluded person re-identification

Feature Completion Transformer for Occluded Person Re-identification

Learning Disentangled Representation Implicitly via Transformer for Occluded Person Re-Identification

Skip Connection Aggregation Transformer for Occluded Person Reidentification

A Semantic Perception and CNN-Transformer Hybrid Network for Occluded Person Re-identification

Exploring Stronger Transformer Representation Learning for Occluded Person Re-Identificatio

Body Part-Level Domain Alignment for Domain-Adaptive Person Re-Identification with Transformer Framework

Exploring Stronger Transformer Representation Learning for Occluded Person Re-Identification

Occluded Person Re-Identification Method Based on Multiscale Features and Human Feature Reconstruction

Diverse Part Discovery: Occluded Person Re-identification with Part-Aware Transformer