Parallel‐branch Network for 3D Human Pose and Shape Estimation in Video

Yuanhao Wu,Chenxing Wang
DOI: https://doi.org/10.1002/cav.2078
IF: 1.01
2022-01-01
Computer Animation and Virtual Worlds
Abstract:Human pose and shape estimation have developed rapidly, where a skinned multi‐person linear (SMPL) approach performs excellent recently. However, the prior template of the human body in the SMPL model is fixed, thus a deviation may be resulted in the reconstructed body shape if a human body acts sharp movements such as sporting or dancing. To address this problem, we propose a parallel‐branch network including a designed spatial–temporal (ST) branch and a SMPL branch. The ST branch essentially performs the 2D‐to‐3D lifting for more accurate joint prediction, by the designed spatial transformer and temporal transformer. The 3D joints from the ST branch are used to supervise the 3D joints from the SMPL branch and further correct the deviation of the SMPL model. Experiments on some popular benchmarks like 3DPW and MPI‐INF‐3DHP show that our method has better performance than other methods with video input. Our code is available at https://automation.seu.edu.cn/wcx/list.htm
What problem does this paper attempt to address?