Abstract:Human pose estimation is aimed at locating the anatomical parts or keypoints of the human body and is regarded as a core component in obtaining detailed human understanding in images or videos. However, the occlusion and overlap upon human bodies and complex backgrounds often result in implausible pose predictions. To address the problem, we propose a structure-aware adversarial framework, which combines cues of local joint interconnectivity and priors about the holistic structure of human bodies, achieving high-quality results for multiperson human pose estimation. Effective learning of such cues and priors is typically a challenge. The presented framework uses a nonparametric representation, which is referred to as the Keypoint Biorientation Field (KBOF), to learn orientation cues of joint interinteractivity in the image, just as human vision can explore geometric constraints of joint interconnectivity. Additionally, a module using multiscale feature representation with inflated convolution for joint heatmap detection and Keypoint Biorientation Field detection is applied in our framework to fully explore the local features of joint points and the bidirectional connectivity between them at the microscopic level. Finally, we employ improving generative adversarial networks which use KBOF and multiscale feature extraction that implicitly leverages the cues and priors about the structure of human bodies for global structural inference. The adversarial network enables our framework to combine information about the connections between local body joints at the microscopic level and the structural priors of the human body at the global level, thus enhancing the performance of our framework. The effectiveness and robustness of the network are evaluated on the task of human pose prediction in two widely used benchmark datasets, i.e., MPII and COCO datasets. Our approach outperforms the state-of-the-art methods, especially in the case of complex scenes. Our method achieves an improvement of 2.6% and 1.7% compared to the latest method on the MPII test set and COCO validation set, respectively.

Adaptive Positive Sample Selection and Dynamic Soft Label Assignment for Keypoint Detection

SD-Pose: facilitating space-decoupled human pose estimation via adaptive pose perception guidance

Semi-supervised 2D Human Pose Estimation via Adaptive Keypoint Masking

Towards High Performance Human Keypoint Detection

Multi-person pose estimation using atrous convolution

Neural Interactive Keypoint Detection

Algorithm of Pedestrian Pose Recognition Based on Keypoint Detection.

X-Pose: Detecting Any Keypoints

Self-supervised Siamese keypoint inference network for human pose estimation and tracking

KSL-POSE: A Real-Time 2D Human Pose Estimation Method Based on Modified YOLOv8-Pose Framework

SSpose: Self-supervised Spatial-aware Model for Human Pose Estimation

A Structure-Aware Adversarial Framework with the Keypoint Biorientation Field for Multiperson Pose Estimation

A Real-Time Head Pose Estimation Using Adaptive Posit Based On Modified Supervised Descent Method

Development of Multi-person Pose Estimation Method Based on PAFs.

KeyPosS: Plug-and-Play Facial Landmark Detection through GPS-Inspired True-Range Multilateration

Parallel Self-Attention and Spatial-Attention Fusion for Human Pose Estimation and Running Movement Recognition

EP-Net: More Efficient Pose Estimation Network with the Classification-based Key-points Detection

Hierarchical Keypoints Feature Alignment for Domain Adaptive Pose Estimation

SAMKR: Bottom-up Keypoint Regression Pose Estimation Method Based on Subspace Attention Module.

AdaptivePose++: A Powerful Single-Stage Network for Multi-Person Pose Regression

MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection