Abstract:Human motion prediction is the key technology for many real-life applications, e.g., self-driving and human–robot interaction. The recent approaches adopt the unrestricted full-connection graph representation to capture the relationships inside the human skeleton. However, there are two issues to be solved: (i) these unrestricted full-connection graph representation methods neglect the inherent dependencies across the joints of the human body; (ii) these methods represent human motions using the features extracted from a single level and thus can neither fully exploit the various connection relationships among the human body nor guarantee the human motion prediction results to be reasonable. To tackle the above issues, we propose an adaptive multi-level hypergraph convolution network (AMHGCN), which uses the adaptive multi-level hypergraph representation to capture various dependencies among the human body. Our method has four different levels of hypergraph representations, including (i) the joint-level hypergraph representation to capture inherent kinetic dependencies in the human body, (ii) the part-level hypergraph representation to exploit the kinetic characteristics at a higher level (in comparison to the joint-level) by viewing some part of the human body as an entirety, (iii) the component-level hypergraph representation to model the semantic information, and (iv) the global-level hypergraph representation to extract long-distance dependencies in the human body. In addition, to take full advantage of the knowledge carried in the training data, we propose a reverse loss ( i.e., adopting the future human poses to predict the historical poses reversely) to realize data augmentation. Extensive experiments show that our proposed AMHGCN can achieve state-of-the-art performance on three benchmarks, i.e., Human3.6M, CMU-Mocap, and 3DPW.

KD-Former: Kinematic and dynamic coupled transformer network for 3D human motion prediction

A Spatio-Temporal Transformer Network for Human Motion Prediction in Human-Robot Collaboration

Towards Realistic 3D Human Motion Prediction with A Spatio-temporal Cross-transformer Approach

KTPFormer: Kinematics and Trajectory Prior Knowledge-Enhanced Transformer for 3D Human Pose Estimation

Joint-Relation Transformer for Multi-Person Motion Prediction.

3D Human Motion Prediction Based on Graph Convolution Network and Transformer

Joint-Aware Transformer: An Inter-Joint Correlation Encoding Transformer for Short-Term 3D Human Motion Prediction

DMS-GCN: Dynamic Mutiscale Spatiotemporal Graph Convolutional Networks for Human Motion Prediction

KSOF: Leveraging kinematics and spatio-temporal optimal fusion for human motion prediction

TransFusion: A Practical and Effective Transformer-based Diffusion Model for 3D Human Motion Prediction

Multi-Person 3D Motion Prediction with Multi-Range Transformers

Human MotionFormer: Transferring Human Motions with Vision Transformers

Auxiliary Tasks Benefit 3D Skeleton-based Human Motion Prediction

AMHGCN: Adaptive multi-level hypergraph convolution network for human motion prediction

An Attractor-Guided Neural Networks for Skeleton-Based Human Motion Prediction

Velocity-to-velocity human motion forecasting

Towards more realistic human motion prediction with attention to motion coordination

TrajectoryCNN: A New Spatio-Temporal Feature Learning Network for Human Motion Prediction

EVOPOSE: A Recursive Transformer For 3D Human Pose Estimation With Kinematic Structure Priors

Robust Human Motion Forecasting using Transformer-based Model

3D Human Pose Estimation with Spatial and Temporal Transformers