Abstract:<p>Human pose estimation is the task of localizing body key points from still images. As body key points are inter-connected, it is desirable to model the structural relationships between body key points to further improve the localization performance. In this paper, based on original graph convolutional networks, we propose a novel model, termed Pose Graph Convolutional Network (PGCN), to exploit these important relationships for pose estimation. Specifically, our model builds a directed graph between body key points according to the natural compositional model of a human body. Each node (key point) is represented by a 3-D tensor consisting of multiple feature maps, initially generated by our backbone network, to retain accurate spatial information. Furthermore, attention mechanism is presented to focus on crucial edges (structured information) between key points. PGCN is then learned to map the graph into a set of structure-aware key point representations which encode both structure of human body and appearance information of specific key points. Additionally, we propose two modules for PGCN, i.e., the Local PGCN (L-PGCN) module and Non-Local PGCN (NL-PGCN) module. The former utilizes spatial attention to capture the correlations between the local areas of adjacent key points to refine the location of key points. While the latter captures long-range relationships via non-local operation to associate the challenging key points. By equipping with these two modules, our PGCN can further improve localization performance. Experiments both on single- and multi-person estimation benchmark datasets show that our method consistently outperforms competing state-of-the-art methods.</p>

Learning Recurrent Structure-Guided Attention Network for Multi-person Pose Estimation.

Exploring Severe Occlusion: Multi-Person 3D Pose Estimation with Gated Convolution.

Multi-Scale Structure-Aware Network for Human Pose Estimation

Research on Multi-level Attention-based Human Pose Estimation

Structure-aware human pose estimation with graph convolutional networks

Joint Multi-Person Pose Estimation and Semantic Part Segmentation

Simplified-attention Enhanced Graph Convolutional Network for 3D human pose estimation

Rethinking on Multi-Stage Networks for Human Pose Estimation

A Context-and-Spatial Aware Network for Multi-Person Pose Estimation

Multi-Scale Supervised Network for Human Pose Estimation

Improving Multiperson Pose Estimation by Mask-aware Deep Reinforcement Learning

Multi-Person Pose Estimation with Accurate Heatmap Regression and Greedy Association

Pose Guided Structured Region Ensemble Network for Cascaded Hand Pose Estimation

3D Human Pose Estimation Via Human Structure-Aware Fully Connected Network

Human Pose Estimation Using Deep Structure Guided Learning.

Human Pose Estimation Based on Lightweight Multi-Scale Coordinate Attention

Interweaved Graph and Attention Network for 3D Human Pose Estimation

RSGNet: Relation Based Skeleton Graph Network for Crowded Scenes Pose Estimation

Multi-Context Attention for Human Pose Estimation.

Multi-person 3D pose estimation from unlabelled data

Multi-Person Pose Estimation with Enhanced Channel-wise and Spatial Information