Computing Solutions to the Polynomial-Polynomial Regulator Problem

Nicholas A. Corbin,Boris Kramer
2024-10-30
Abstract:We consider the optimal regulation problem for nonlinear control-affine dynamical systems. Whereas the linear-quadratic regulator (LQR) considers optimal control of a linear system with quadratic cost function, we study polynomial systems with polynomial cost functions; we call this problem the polynomial-polynomial regulator (PPR). The resulting polynomial feedback laws provide two potential improvements over linear feedback laws: 1) they more accurately approximate the optimal control law, resulting in lower control costs, and 2) for some problems they can provide a larger region of stabilization. We derive explicit formulas -- and a scalable, general purpose software implementation -- for computing the polynomial approximation to the value function that solves the optimal control problem. The method is illustrated first on a low-dimensional aircraft stall stabilization example, for which PPR control recovers the aircraft from more severe stall conditions than LQR control. Then we demonstrate the scalability of the approach on a semidiscretization of dimension $n=129$ of a partial differential equation, for which the PPR control reduces the control cost by approximately 75% compared to LQR for the initial condition of interest.
Optimization and Control
What problem does this paper attempt to address?
This paper aims to solve the optimal regulation problem of nonlinear control - affine dynamical systems. Specifically, it studies the optimal control problem of polynomial systems with polynomial cost functions (Polynomial - Polynomial Regulator, PPR). Different from the Linear - Quadratic Regulator (LQR), PPR considers nonlinear systems and polynomial cost functions, which offer two potential advantages: 1. **More accurate approximation of optimal control**: Compared with linear feedback laws, polynomial feedback laws can approximate the optimal control law more accurately, thus reducing the control cost. 2. **Larger stability region**: For some problems, polynomial feedback laws can provide a larger stability region. To solve this problem, the author derives explicit formulas and develops a scalable general - purpose software implementation for computing polynomial value function approximations that solve the optimal control problem. This method was first verified in a low - dimensional aircraft stall stabilization example, and the results show that PPR control can recover the aircraft from more severe stall conditions than LQR control. Subsequently, the method demonstrated its scalability on a semi - discretized model of partial differential equations (dimension \(n = 129\)), and the results show that PPR control reduces the control cost by approximately 75% under the initial conditions of interest. ### Summary of Mathematical Formulas 1. **Dynamic system model**: \[ \dot{x}(t)=f(x(t)) + g(x(t))u(t) \] where \(x(t)\in\mathbb{R}^n\) is the state, \(u(t)\in\mathbb{R}^m\) is the input, \(f:\mathbb{R}^n\rightarrow\mathbb{R}^n\) is the drift term, and \(g:\mathbb{R}^n\rightarrow\mathbb{R}^{n\times m}\) is the input mapping. 2. **Cost function**: \[ J(x_0, u):=\frac{1}{2}\int_0^{\infty}(x^{\top}Q(x)x + u^{\top}R(x)u)dt \] where \(Q(x)\succeq0\) and \(R(x)\succ0\) are non - negative definite and positive definite symmetric matrix - valued functions, respectively. 3. **Value function**: \[ V(x_0):=\min_u J(x_0, u) \] 4. **HJB PDE**: \[ 0=\frac{\partial V^{\top}(x)}{\partial x}f(x)-\frac{1}{2}\frac{\partial V^{\top}(x)}{\partial x}g(x)R^{-1}(x)g^{\top}(x)\frac{\partial V(x)}{\partial x}+\frac{1}{2}x^{\top}Q(x)x \] 5. **Optimal control law**: \[ u^*(x)=-R^{-1}(x)g^{\top}(x)\frac{\partial V(x)}{\partial x} \] 6. **Value function approximation in Al’brekht method**: \[ V(x)\approx\frac{1}{2}x^{\top}V_2x+\frac{1}{2}\sum_{i = 3}^d v_i^{\top}x^i \] where \(v_2=\text{vec}(V_2)\), and \(V_2\) is the solution of the algebraic Riccati equation: \[ 0 = A^{\top}V_2+V_2A - V_2BR^{-1}B