Model Predictive Control of Nonlinear Dynamics Using Online Adaptive Koopman Operators

Daisuke Uchida,Karthik Duraisamy
2024-12-04
Abstract:This paper develops a methodology for adaptive data-driven Model Predictive Control (MPC) using Koopman operators. While MPC is ubiquitous in various fields of engineering, the controller performance can deteriorate if the modeling error between the control model and the true dynamics persists, which may often be the case with complex nonlinear dynamics. Adaptive MPC techniques learn models online such that the controller can compensate for the modeling error by incorporating newly available data. We utilize the Koopman operator framework to formulate an adaptive MPC technique that corrects for model discrepancies in a computationally efficient manner by virtue of convex optimization. With the use of neural networks to learn embedding spaces, Koopman operator models enable accurate dynamics modeling. Such complex model forms, however, often lead to unstable online learning. To this end, the proposed method utilizes the soft update of target networks, a technique used in stabilization of model learning in Reinforcement Learning (RL). Also, we provide a discussion on which parameters to be chosen as online updated parameters based on a specific description of linear embedding models. Numerical simulations on a canonical nonlinear dynamical system show that the proposed method performs favorably compared to other data-driven MPC methods while achieving superior computational efficiency through the utilization of Koopman operators.
Optimization and Control
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: **How to improve the model predictive control (MPC) performance of nonlinear systems through an adaptive data - driven method in the presence of modeling errors?** Specifically, the paper focuses on how to compensate for this error through online learning and adaptive adjustment when there is a deviation between the control model and the real - system dynamics. Traditional MPC methods may experience performance degradation in complex nonlinear systems due to modeling errors. To solve this problem, the authors propose an adaptive MPC method based on the Koopman operator, which can correct the model - mismatch problem through online learning with relatively high computational efficiency. ### Key Point Summary: 1. **Problem Background**: - Model predictive control (MPC) is widely used in various engineering applications, but its performance depends on the accuracy of the underlying control model. - For complex nonlinear systems, the control model usually has a deviation from the real - system dynamics, resulting in performance degradation. 2. **Research Objectives**: - Propose an adaptive MPC method based on the Koopman operator to address the challenges brought by modeling errors. - Improve control performance and maintain computational efficiency through online learning and adaptive adjustment. 3. **Solutions**: - Use the Koopman operator framework to transform nonlinear dynamics into a linear representation in the embedding space, thereby taking advantage of linear control theory. - Introduce the soft update technique (soft update of target networks), similar to the method in reinforcement learning, to stabilize the online learning process. - Selectively update model parameters, especially the feature mapping and matrix B, to improve the computational efficiency and robustness of the algorithm. 4. **Experimental Verification**: - Verify the effectiveness and superiority of the proposed method through numerical simulations (such as the inverted pendulum system), especially in dealing with modeling errors. - The results show that compared with other data - driven MPC methods, the proposed method performs well in terms of average error and execution time. ### Formula Explanation: - **State Update Formula**: \[ x_{k + 1}=f(x_k, u_k) \] where \( x_k\in X\subseteq\mathbb{R}^n \) and \( u_k\in U\subseteq\mathbb{R}^p \) represent the state and control input respectively, and \( f:\mathbb{R}^n\times\mathbb{R}^p\rightarrow\mathbb{R}^n \) is a possible nonlinear mapping. - **Cost Function**: \[ J:=\sum_{k = 0}^{H+1}\left((x_k - x_{\text{ref}, k})^T Q_{\text{state}}(x_k - x_{\text{ref}, k})+u_k^T R u_k\right) \] where \( H \) is the prediction horizon, \( x_{\text{ref}, k} \) is the reference signal, and \( Q_{\text{state}} \) and \( R \) are weight matrices. - **Linear Embedding Model**: \[ x_{\text{pred}, k+1}=C\{A g(x_k)+B u_k\} \] where \( g:X\rightarrow\mathbb{R}^N \) is the feature mapping, \( A\in\mathbb{R}^{N\times N} \), \( B\in\mathbb{R}^{N\times p} \), \( C\in\mathbb{R}^{n\times N} \) are matrices. Through these methods, the paper effectively solves the MPC modeling error problem in nonlinear systems and shows its potential in practical applications.