Learning-Based Design of Off-Policy Gaussian Controllers: Integrating Model Predictive Control and Gaussian Process Regression

Shiva Kumar Tekumatla,Varun Gampa,Siavash Farzan
2024-03-16
Abstract:This paper presents an off-policy Gaussian Predictive Control (GPC) framework aimed at solving optimal control problems with a smaller computational footprint, thereby facilitating real-time applicability while ensuring critical safety considerations. The proposed controller imitates classical control methodologies by modeling the optimization process through a Gaussian process and employs Gaussian Process Regression to learn from the Model Predictive Control (MPC) algorithm. Notably, the Gaussian Process setup does not incorporate a built-in model, enhancing its applicability to a broad range of control problems. We applied this framework experimentally to a differential drive mobile robot, tasking it with trajectory tracking and obstacle avoidance. Leveraging the off-policy aspect, the controller demonstrated adaptability to diverse trajectories and obstacle behaviors. Simulation experiments confirmed the effectiveness of the proposed GPC method, emphasizing its ability to learn the dynamics of optimal control strategies. Consequently, our findings highlight the significant potential of off-policy Gaussian Predictive Control in achieving real-time optimal control for handling of robotic systems in safety-critical scenarios.
Robotics,Systems and Control
What problem does this paper attempt to address?
The paper aims to address optimization problems in real-time robot control and ensure safety in critical application scenarios. Specifically, the paper proposes an off-policy predictive control (GPC) framework based on Gaussian processes to reduce the computational load required by traditional model predictive control (MPC) methods, thereby enabling real-time applications. This controller learns from the MPC algorithm through Gaussian process regression (GPR) and does not require an embedded system dynamics model, making it suitable for various control problems. The main contribution of the research is the proposal of a general off-policy learning algorithm that combines optimal control methods and learning capabilities to understand the behavior of MPC. Compared to traditional MPC methods, this framework leverages existing data for inference without the need for iterative solving of the optimal control problem. Experimental results show that the proposed off-policy Gaussian predictive control can achieve or even surpass the performance of MPC in trajectory tracking and obstacle avoidance, highlighting the potential of this method to achieve real-time optimal control in complex robotic systems.