Abstract:Person search is a time-consuming computer vision task that entails locating and recognizing query people in scenic pictures. Body components are commonly mismatched during matching due to position variation, occlusions, and partially absent body parts, resulting in unsatisfactory person search results. Existing approaches for extracting local characteristics of the human body using keypoint information are unable to handle the search job when distinct body parts are misaligned, ignoring to exploit multiple granularities, which is crucial in the person search process. Moreover, the alignment learning methods learn body part features with fixed and equal weights, ignoring the beneficial contextual information, e.g., the umbrella carried by the pedestrian, which supplements compelling clues for identifying the person. In this paper, we propose a Coarse-to-Fine Adaptive Alignment Representation (CFA 2 R) network for learning multiple granular features in misaligned person search in the coarse-to-fine perspective. To exploit more beneficial body parts and related context of the cropped pedestrians, we design a Part-Attentional Progressive Module (PAPM) to guide the network to focus on informative body parts and positive accessorial regions. Besides, we propose a Re-weighting Alignment Module (RAM) shedding light on more contributive parts instead of treating them equally. Specifically, adaptive re-weighted but not fixed part features are reconstructed by Re-weighting Reconstruction module, considering that different parts serve unequally during image matching. Extensive experiments conducted on CUHK-SYSU and PRW datasets demonstrate competitive performance of our proposed method.

Joint discriminative representation learning for end-to-end person search

Towards Fully Decoupled End-to-End Person Search

Diverse Knowledge Distillation for End-to-End Person Search

Learning adaptive shift and task decoupling for discriminative one-step person search

Beyond the Parts: Learning Coarse-to-Fine Adaptive Alignment Representation for Person Search

Learning Context-Aware Embedding for Person Search

Sequential End-to-end Network for Efficient Person Search

Dual Attention Matching Network for Context-Aware Feature Sequence based Person Re-Identification

Decoupled and Memory-Reinforced Networks: Towards Effective Feature Learning for One-Step Person Search

Segmentation Mask Guided End-to-End Person Search

Towards effective person search with deep learning: A survey from systematic perspective

Dual Context-Aware Refinement Network for Person Search.

Temporal Complementarity-Guided Reinforcement Learning for Image-to-Video Person Re-Identification

DMRNet++: Learning Discriminative Features with Decoupled Networks and Enriched Pairs for One-Step Person Search

Bi-Directional Interaction Network for Person Search

Deep-Person: Learning discriminative deep features for person Re-Identification

Learning deep part-aware embedding for person retrieval

Improved Instance Discrimination and Feature Compactness for End-to-End Person Search

LEAPS: End-to-End One-Step Person Search With Learnable Proposals

Person Search by Multi-Scale Matching

Rcaa: Relational Context-Aware Agents For Person Search