KD-Former: Kinematic and dynamic coupled transformer network for 3D human motion prediction

Ju Dai,Hao Li,Rui Zeng,Junxuan Bai,Feng Zhou,Junjun Pan
DOI: https://doi.org/10.1016/j.patcog.2023.109806
IF: 8
2023-07-10
Pattern Recognition
Abstract:Recent studies have made remarkable progress on 3D human motion prediction by describing motion with kinematic knowledge. However, kinematics only considers the 3D positions or rotations of human skeletons, failing to reveal the physical characteristics of human motion. Motion dynamics reflects the forces between joints, explicitly encoding the skeleton topology, whereas rarely exploited in motion prediction. In this paper, we propose the K inematic and D ynamic coupled trans Former (KD-Former), which incorporates dynamics with kinematics, to learn powerful features for high-fidelity motion prediction. Specifically, We first formulate a reduced-order dynamic model of human body to calculate the forces of all joints. Then we construct a non-autoregressive encoder-decoder framework based on the transformer structure. The encoder involves a kinematic encoder and a dynamic encoder, which are respectively responsible for extracting the kinematic and dynamic features for given history sequences via a spatial transformer and a temporal transformer. Future query sequences are decoded in parallel in the decoder by leveraging the encoded kinematic and dynamic information of history sequences. Experiments on Human3.6M and CMU MoCap benchmarks verify the effectiveness and superiority of our method. Code will be available at: https://github.com/wslh852/KD-Former.git .
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?