Prior-knowledge-based Self-Attention Network for 3D Human Pose Estimation

Shu Chen,Yaxin Xu,Beiji Zou
DOI: https://doi.org/10.1016/j.eswa.2023.120213
IF: 8.5
2023-01-01
Expert Systems with Applications
Abstract:Estimating three-dimensional (3D) human poses from two-dimensional (2D) joints has achieved promising results. However, there is relatively little work focused on exploiting domain-specific knowledge as prior. In this work, we present a learning framework based on prior knowledge for the task of estimating a 3D human pose from a 2D pose. In contrast to other state-of-the-art 3D pose estimation approaches, the proposed method is a systematic analysis pipeline that takes full advantage of prior knowledge based on three observations. The proposed approach can model the spatial and temporal relations between joints to achieve better performance. Our approach formulates the learning network as an encoder–decoder architecture that explicitly encodes prior knowledge about the task. The encoder is a multi-head self-attention network which can capture human joint spatial relations. The decoder is formulated as three separated sub-networks, each sub-network represents a kinematic chain which is derived from our prior knowledge about human motion. The experimental results on the Human3.6M, HumanEva and MPI-INF-3DHP datasets demonstrate the effectiveness of our approach. The code and data are available at https://github.com/XTU-PR-LAB/PK-SAN.
What problem does this paper attempt to address?