HPGCN: Hierarchical Poselet-Guided Graph Convolutional Network for 3D Pose Estimation

Yongpeng Wu,Dehui Kong,Shaofan Wang,Jinghua Li,Baocai Yin
DOI: https://doi.org/10.1016/j.neucom.2021.11.007
IF: 6
2021-01-01
Neurocomputing
Abstract:3D pose estimation remains a challenging task since human poses exhibit high ambiguity and multi-granularity. Traditional graph convolution networks (GCNs) accomplish the task by modeling all skeletons as an entire graph, and are unable to fuse combinable part-based features. By observing that human movements occur due to part of human body (i.e. related skeletons and body components, known as the poselet) and those poselets contribute to each movement in a hierarchical fashion, we propose a hierarchical poselet-guided graph convolutional network (HPGCN) for 3D pose estimation from 2D poses. HPGCN sets five primitives of human body as basic poselets, and constitutes high-level poselets according to the kinematic configuration of human body. Moreover, HPGCN forms a fundamental unit by using a diagonally dominant graph convolution layer and a non-local layer, which corporately capture the multi-granular feature of human poses from local to global perspective. Finally HPGCN designs a geometric constraint loss function with constraints on lengths and directions of bone vectors, which help produce reasonable pose regression. We verify the effectiveness of HPGCN on three public 3D human pose benchmarks. Experimental results show that HPGCN outperforms several state-of-the-art methods.
What problem does this paper attempt to address?