Abstract:Multi-person motion prediction is an emerging and intricate task with broad real-world applications. Unlike single person motion prediction, it considers not just the skeleton structures or human trajectories but also the interactions between others. Previous methods use various networks to achieve impressive predictions but often overlook that the joints relations within an individual (intra-relation) and interactions among groups (inter-relation) are distinct types of representations. These methods often lack explicit representation of inter&intra-relations, and inevitably introduce undesired dependencies. To address this issue, we introduce a new collaborative framework for multi-person motion prediction that explicitly modeling these relations:a GCN-based network for intra-relations and a novel reasoning network for <a class="link-external link-http" href="http://inter-relations.Moreover" rel="external noopener nofollow">this http URL</a>, we propose a novel plug-and-play aggregation module called the Interaction Aggregation Module (IAM), which employs an aggregate-attention mechanism to seamlessly integrate these relations. Experiments indicate that the module can also be applied to other dual-path models. Extensive experiments on the 3DPW, 3DPW-RC, CMU-Mocap, MuPoTS-3D, as well as synthesized datasets Mix1 & Mix2 (9 to 15 persons), demonstrate that our method achieves state-of-the-art performance.

What problem does this paper attempt to address?

This paper attempts to solve the complex interaction problems in multi - person motion prediction. Specifically, the author points out the deficiencies of existing methods in dealing with multi - person motion prediction: 1. **Ignoring different types of associations**: Existing methods usually consider all skeletal joints as a whole to establish relationships, while ignoring that the intra - relation among joints within an individual and the inter - relation among joints between individuals are different representation forms. 2. **Introducing unnecessary dependencies**: Since these two relationships are not clearly distinguished, existing methods may introduce unwanted dependencies, which weakens the learning effect of interaction information and simplifies joint constraints. To solve these problems, the author proposes a new collaborative framework that explicitly models these two relationships: - Use a GCN - based network method to model the intra - relation among joints within an individual. - Introduce a new inference network to model the inter - relation among interactions between individuals. In addition, the author also proposes a new plug - in aggregation module - Interaction Aggregation Module (IAM), which adopts an aggregate - attention mechanism and can seamlessly integrate these relationships. Experimental results show that this module can not only be applied to other two - path models, but also significantly improve the prediction performance. ### Specific contribution summary: 1. **Proposing a new collaborative learning framework**: This framework explicitly models the intra - relation among joints within an individual and the inter - relation among interactions between individuals, enhancing the robustness and interpretability of the model. 2. **Introducing the Interaction Aggregation Module (IAM)**: This module effectively integrates intra - relation and inter - relation through the aggregate - attention mechanism and has good plug - in compatibility. 3. **Achieving state - of - the - art performance on multiple datasets**: Including 3DPW, 3DPW - RC, CMU - Mocap, MuPoTS - 3D, and synthetic datasets Mix1 and Mix2. Through these improvements, the author aims to more accurately capture the complex interaction relationships among multiple people, thereby improving the effect of multi - person motion prediction.

Relation Learning and Aggregate-attention for Multi-person Motion Prediction

Multi-Granularity Interaction for Multi-Person 3D Motion Prediction

Forecasting Distillation: Enhancing 3D Human Motion Prediction with Guidance Regularization

Towards more realistic human motion prediction with attention to motion coordination

Joint-Relation Transformer for Multi-Person Motion Prediction.

Multi-level Motion Attention for Human Motion Prediction

A multilayer human motion prediction perceptron by aggregating repetitive motion

Geometric algebra-based multiview interaction networks for 3D human motion prediction

A Spatio-Temporal Transformer Network for Human Motion Prediction in Human-Robot Collaboration

UnityGraph: Unified Learning of Spatio-temporal features for Multi-person Motion Prediction

An Attractor-Guided Neural Networks for Skeleton-Based Human Motion Prediction

Multi-Person 3D Motion Prediction with Multi-Range Transformers

EqMotion: Equivariant Multi-agent Motion Prediction with Invariant Interaction Reasoning

Investigating Pose Representations and Motion Contexts Modeling for 3D Motion Prediction

PGformer: Proxy-Bridged Game Transformer for Multi-Person Highly Interactive Extreme Motion Prediction

Multimodal Sense-Informed Prediction of 3D Human Motions

April-GCN: Adjacency Position-velocity Relationship Interaction Learning GCN for Human motion prediction

The MI-Motion Dataset and Benchmark for 3D Multi-Person Motion Prediction

Enhancing Robotic Collaborative Tasks Through Contextual Human Motion Prediction and Intention Inference

DMS-GCN: Dynamic Mutiscale Spatiotemporal Graph Convolutional Networks for Human Motion Prediction

History Repeats Itself: Human Motion Prediction via Motion Attention