Abstract:Global feature based Pedestrian Attribute Recognition (PAR) models are often poorly localized when using Grad-CAM for attribute response analysis, which has a significant impact on the interpretability, generalizability and performance. Previous researches have attempted to improve generalization and interpretation through meticulous model design, yet they often have neglected or underutilized effective prior information crucial for PAR. To this end, a novel Scale and Spatial Priors Guided Network (SSPNet) is proposed for PAR, which is mainly composed of the Adaptive Feature Scale Selection (AFSS) and Prior Location Extraction (PLE) modules. The AFSS module learns to provide reasonable scale prior information for different attribute groups, allowing the model to focus on different levels of feature maps with varying semantic granularity. The PLE module reveals potential attribute spatial prior information, which avoids unnecessary attention on irrelevant areas and lowers the risk of model over-fitting. More specifically, the scale prior in AFSS is adaptively learned from different layers of feature pyramid with maximum accuracy, while the spatial priors in PLE can be revealed from part feature with different granularity (such as image blocks, human pose keypoint and sparse sampling points). Besides, a novel IoU based attribute localization metric is proposed for Weakly-supervised Pedestrian Attribute Localization (WPAL) based on the improved Grad-CAM for attribute response mask. The experimental results on the intra-dataset and cross-dataset evaluations demonstrate the effectiveness of our proposed method in terms of mean accuracy (mA). Furthermore, it also achieves superior performance on the PCS dataset for attribute localization in terms of IoU. Code will be released at <a class="link-external link-https" href="https://github.com/guotengg/SSPNet" rel="external noopener nofollow">this https URL</a>.

An efficient pedestrian attributes recognition system under challenging conditions

Dual-branch Self-Attention Network for Pedestrian Attribute Recognition

A novel self-boosting dual-branch model for pedestrian attribute recognition

Pedestrian Attribute Recognition Via Spatio-temporal Relationship Learning for Visual Surveillance

Deep Template Matching for Pedestrian Attribute Recognition with the Auxiliary Supervision of Attribute-wise Keypoints

Pedestrian attribute recognition: A survey

Exponential Information Bottleneck Theory Against Intra-Attribute Variations for Pedestrian Attribute Recognition

SNN-PAR: Energy Efficient Pedestrian Attribute Recognition via Spiking Neural Networks

Recurrent Attention Model for Pedestrian Attribute Recognition.

Pedestrian Attribute Recognition: A New Benchmark Dataset and A Large Language Model Augmented Framework

Generate and adjust: a novel framework for semi-supervised pedestrian attribute recognition

Pedestrian Attribute Recognition via CLIP based Prompt Vision-Language Fusion

A Richly Annotated Dataset for Pedestrian Attribute Recognition

SSPNet: Scale and Spatial Priors Guided Generalizable and Interpretable Pedestrian Attribute Recognition

Incremental Few-Shot Learning for Pedestrian Attribute Recognition

Orientation-Aware Pedestrian Attribute Recognition based on Graph Convolution Network

Enhancing Person Re-Identification through Attention-Driven Global Features and Angular Loss Optimization

UPAR: Unified Pedestrian Attribute Recognition and Person Retrieval

Pedestrian Recognition in Multi-Camera Networks Using Multilevel Important Salient Feature and Multicategory Incremental Learning.

Attribute-Aware Pedestrian Detection in a Crowd

SequencePAR: Understanding Pedestrian Attributes via A Sequence Generation Paradigm