Abstract:Human motion prediction is the key technology for many real-life applications, e.g., self-driving and human–robot interaction. The recent approaches adopt the unrestricted full-connection graph representation to capture the relationships inside the human skeleton. However, there are two issues to be solved: (i) these unrestricted full-connection graph representation methods neglect the inherent dependencies across the joints of the human body; (ii) these methods represent human motions using the features extracted from a single level and thus can neither fully exploit the various connection relationships among the human body nor guarantee the human motion prediction results to be reasonable. To tackle the above issues, we propose an adaptive multi-level hypergraph convolution network (AMHGCN), which uses the adaptive multi-level hypergraph representation to capture various dependencies among the human body. Our method has four different levels of hypergraph representations, including (i) the joint-level hypergraph representation to capture inherent kinetic dependencies in the human body, (ii) the part-level hypergraph representation to exploit the kinetic characteristics at a higher level (in comparison to the joint-level) by viewing some part of the human body as an entirety, (iii) the component-level hypergraph representation to model the semantic information, and (iv) the global-level hypergraph representation to extract long-distance dependencies in the human body. In addition, to take full advantage of the knowledge carried in the training data, we propose a reverse loss ( i.e., adopting the future human poses to predict the historical poses reversely) to realize data augmentation. Extensive experiments show that our proposed AMHGCN can achieve state-of-the-art performance on three benchmarks, i.e., Human3.6M, CMU-Mocap, and 3DPW.

A three-dimensional human motion pose recognition algorithm based on graph convolutional networks

3D Human Motion Prediction Based on Graph Convolution Network and Transformer

MFGCN: an efficient graph convolutional network based on multi-order feature information for human skeleton action recognition

Modelling Human Body Pose for Action Recognition Using Deep Neural Networks

Multi-hop graph transformer network for 3D human pose estimation

Simplified-attention Enhanced Graph Convolutional Network for 3D human pose estimation

Learning Dynamical Human-Joint Affinity for 3D Pose Estimation in Videos

Optimizing Network Structure for 3D Human Pose Estimation.

Action recognition algorithm based on skeleton graph with multiple features and improved adjacency matrix

AMHGCN: Adaptive multi-level hypergraph convolution network for human motion prediction

Symbiotic Graph Neural Networks for 3D Skeleton-based Human Action Recognition and Motion Prediction

A Two-stream Hybrid CNN-Transformer Network for Skeleton-based Human Interaction Recognition

Improved Graph Convolutional Neural Network for Dance Tracking and Pose Estimation

Joint graph convolution networks and transformer for human pose estimation in sports technique analysis

High Efficient LSTM-based Network for Human Interaction Understanding

A Lightweight Attentional Shift Graph Convolutional Network for Skeleton-Based Action Recognition

Conditional Directed Graph Convolution for 3D Human Pose Estimation

Learning to Recognize 3D Human Action from A New Skeleton-based Representation Using Deep Convolutional Neural Networks

Flexible graph convolutional network for 3D human pose estimation

Action Recognition Based on Joint Trajectory Maps with Convolutional Neural Networks

3D-UGCN: A Unified Graph Convolutional Network for Robust 3D Human Pose Estimation from Monocular RGB Images