Fletcher Fan,Bowen Yi,David Rye,Guodong Shi,Ian R. Manchester
Abstract:This paper introduces new model parameterizations for learning dynamical systems from data via the Koopman operator, and studies their properties. Whereas most existing works on Koopman learning do not take into account the stability or stabilizability of the model -- two fundamental pieces of prior knowledge about a given system to be identified -- in this paper, we propose new classes of Koopman models that have built-in guarantees of these properties. These models are guaranteed to be stable or stabilizable via a novel {\em direct parameterization approach} that leads to {\em unconstrained} optimization problems with respect to their parameter sets. To explore the representational flexibility of these model sets, we establish novel theoretical connections between the stability of discrete-time Koopman embedding and contraction-based forms of nonlinear stability and stabilizability. The proposed approach is illustrated in applications to stable nonlinear system identification and imitation learning via stabilizable models. Simulation results empirically show that the learning approaches based on the proposed models outperform prior methods lacking stability guarantees.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to ensure the stability and stabilizability of the model when learning the dynamic system model from data. Most of the existing Koopman learning methods do not consider these crucial prior knowledge, namely the stability and stabilizability of the system. This paper proposes a new class of Koopman models, which guarantees the stability or stabilizability of the model during the unconstrained optimization process on the parameter set through a novel direct parameterization method. Specifically, the main contributions of the paper include:
1. **New Parameterization of Stable Koopman Models**:
- A new parameterization method is proposed, which makes the set of stable Koopman models unconstrained in parameters, so that automatic differentiation tools can be efficiently used for "plug - and - play" optimization.
2. **Equivalence of Koopman and Contraction Criteria for Discrete - Time Nonlinear Systems**:
- The equivalence between the Koopman stability criterion and the contraction criterion for discrete - time nonlinear systems is proved, which extends the author's previous work in continuous - time systems. This equivalence is very useful for proposing a new Koopman learning framework that can learn all stable autonomous systems.
3. **Generalization to Generalized Feedback Linearization Systems**:
- The proposed set of Koopman models is extended to generalized feedback linearization systems, and the stabilizability constraint is simultaneously imposed on the model set. These results are applied to the imitation learning problem, that is, learning the control strategy from the demonstration data, and regularization is introduced to encourage closed - loop stability.
### Main Motivation and Problem Setting of the Paper
The paper focuses on discrete - time nonlinear autonomous systems and control systems, but assumes that the dynamics are unknown. Given a data set \( E_D \) and a cost function \( J(E_D,\hat{f},\hat{g}) \), the paper proposes two main problems:
1. **Learning Stable Autonomous Systems**:
- Consider a contraction system (i.e., the case where \( u\equiv0 \)), learn an approximate model \( \hat{f} \) from the data set \( E_D \), and ensure that the identified dynamics \( x_{t + 1}=\hat{f}(x_t) \) are contractive.
2. **Imitation Learning**:
- Consider the data set \( E_D \) generated by the asymptotically stabilizable system (5), learn a static feedback \( u = \rho(x) \), approximate the demonstration data, and simultaneously ensure that the closed - loop system \( x_{t+1}=f(x_t)+g(x_t)\rho(x_t) \) is contractive.
### Main Theoretical Problems
The main theoretical problem of the paper is how to parameterize these model sets so that they are unconstrained in parameters. This effort has practical significance in the following applications:
- **Class of Stable Koopman Models**:
- A class of stable Koopman models covering all contraction systems is defined, and its equivalent parameterization is studied. Based on this, a new algorithm is proposed to learn stable Koopman embeddings.
- **Class of Stabilizable Koopman Models**:
- For generalized feedback linearization systems, a class of stabilizable Koopman models is proposed, and its properties are studied. These results are applied to the imitation learning problem to ensure that the learned control strategy has closed - loop stability.
### Parameterization Method
In order to make the model set unconstrained in parameters, the paper proposes the following parameterization methods:
- **Parameterization of Stable Matrix \( A \)**:
- An unconstrained parameterization method is used to make the matrix \( A \) have Schur stability. The specific method is to ensure that \( A \) satisfies the Lyapunov inequality through specific matrix decomposition techniques.
- **Parameterization of Koopman Embedding \( \phi \)**:
- Neural networks or other differentiable function approximators are used to parameterize the Koopman embedding \( \phi \), and ensure that its left inverse \( \phi_L \) exists and is computable.
### Learning Framework
Under the proposed parameterization of the model class, the data set \( E_D \) of the actual system is used to fit the parameter \( \theta \). Specifically, the most...