Online Learning of Interaction Dynamics with Dual Model Predictive Control for Multi-Agent Systems Using Gaussian Processes

T.M.J.T. Baltussen,A. Katriniok,E. Lefeber,R. Tóth,W.P.M.H. Heemels
2024-08-31
Abstract:Decentralized control in complex and uncertain multi-agent scenarios requires careful consideration of the interactions between the agents. In this context, this paper proposes a dual model predictive control (MPC) method using Gaussian process (GP) models for multi-agent systems. While Gaussian process MPC (GP-MPC) has been shown to be effective in predicting the dynamics of other agents, current methods do not consider the influence of the control input on the covariance of the predictions, and, hence, lack the dual control effect. Therefore, we propose a dual MPC that directly optimizes the actions of the ego agent, and the belief of the other agents by jointly optimizing their state trajectories as well as the associated covariance while considering their interactions through a GP. We demonstrate our GP-MPC method in a simulation study on autonomous driving, showing improved prediction quality compared to a baseline stochastic MPC. Simulation results show that our GP-MPC can learn the interactions between agents online, demonstrating the potential of GPs for dual MPC in uncertain and unseen scenarios.
Optimization and Control
What problem does this paper attempt to address?
This paper attempts to solve the problem of how to effectively predict and handle the interactions between agents when implementing decentralized control in complex and uncertain multi - agent systems. Specifically, the paper proposes a Dual Model Predictive Control (MPC) method using Gaussian Processes (GP) models to improve the prediction of other agents' dynamics and consider the impact of control inputs on the prediction covariance, thereby achieving a safer and more efficient control strategy. ### Main problems 1. **Uncertainty in prediction and control**: - Existing methods often overlook the impact of control inputs on the prediction covariance when predicting the dynamics of other agents, resulting in a lack of dual control effect. The dual control effect means that the control strategy not only affects the state of the system but also the probability distribution related to that state. 2. **Online learning and adaptability**: - In a multi - agent environment, the behavior of agents is often uncertain, especially in the absence of explicit communication. Therefore, a method that can learn online and adapt to these uncertainties is required. ### Solutions 1. **Dual Model Predictive Control (Dual MPC)**: - A new dual MPC method is proposed. This method directly optimizes the control input by jointly optimizing the state trajectory of the agent and its associated covariance while considering the interactions between agents. 2. **Gaussian Process model (GP)**: - The Gaussian Process model is used to predict the dynamics of other agents and learn the interactions between agents through Bayesian inference. This method can learn online and adapt to unknown or unseen behaviors. ### Experimental verification - The paper demonstrates the superiority of the proposed GP - MPC method in terms of prediction quality and safety through a lane - merging simulation study of self - driving vehicles. The experimental results show that GP - MPC can successfully predict the movement of other vehicles through online learning without pre - training data and find the appropriate merging time. ### Contributions 1. **Extension of GP - MPC**: - GP - MPC is extended to local control in a multi - agent environment. By joint probability and Bayesian inference, it adapts to the state covariance of other agents to achieve the dual control effect. 2. **Application cases**: - The proposed method is applied in the lane - merging scenario. By collecting training data online, GP - MPC can adapt to the uncertain behaviors of other agents, showing its generalization ability in unseen situations. In conclusion, by introducing the dual - model predictive control and the Gaussian process model, this paper solves the problem of uncertainty in prediction and control in multi - agent systems, providing new ideas and methods for achieving safer and more efficient self - driving.