Abstract:We consider the optimal regulation problem for nonlinear control-affine dynamical systems. Whereas the linear-quadratic regulator (LQR) considers optimal control of a linear system with quadratic cost function, we study polynomial systems with polynomial cost functions; we call this problem the polynomial-polynomial regulator (PPR). The resulting polynomial feedback laws provide two potential improvements over linear feedback laws: 1) they more accurately approximate the optimal control law, resulting in lower control costs, and 2) for some problems they can provide a larger region of stabilization. We derive explicit formulas -- and a scalable, general purpose software implementation -- for computing the polynomial approximation to the value function that solves the optimal control problem. The method is illustrated first on a low-dimensional aircraft stall stabilization example, for which PPR control recovers the aircraft from more severe stall conditions than LQR control. Then we demonstrate the scalability of the approach on a semidiscretization of dimension $n=129$ of a partial differential equation, for which the PPR control reduces the control cost by approximately 75% compared to LQR for the initial condition of interest.

What problem does this paper attempt to address?

This paper aims to solve the optimal regulation problem of nonlinear control - affine dynamical systems. Specifically, it studies the optimal control problem of polynomial systems with polynomial cost functions (Polynomial - Polynomial Regulator, PPR). Different from the Linear - Quadratic Regulator (LQR), PPR considers nonlinear systems and polynomial cost functions, which offer two potential advantages: 1. **More accurate approximation of optimal control**: Compared with linear feedback laws, polynomial feedback laws can approximate the optimal control law more accurately, thus reducing the control cost. 2. **Larger stability region**: For some problems, polynomial feedback laws can provide a larger stability region. To solve this problem, the author derives explicit formulas and develops a scalable general - purpose software implementation for computing polynomial value function approximations that solve the optimal control problem. This method was first verified in a low - dimensional aircraft stall stabilization example, and the results show that PPR control can recover the aircraft from more severe stall conditions than LQR control. Subsequently, the method demonstrated its scalability on a semi - discretized model of partial differential equations (dimension $n = 129$), and the results show that PPR control reduces the control cost by approximately 75% under the initial conditions of interest. ### Summary of Mathematical Formulas 1. **Dynamic system model**: \[ \dot{x}(t)=f(x(t)) + g(x(t))u(t) \] where $x(t)\in\mathbb{R}^n$ is the state, $u(t)\in\mathbb{R}^m$ is the input, $f:\mathbb{R}^n\rightarrow\mathbb{R}^n$ is the drift term, and $g:\mathbb{R}^n\rightarrow\mathbb{R}^{n\times m}$ is the input mapping. 2. **Cost function**: \[ J(x_0, u):=\frac{1}{2}\int_0^{\infty}(x^{\top}Q(x)x + u^{\top}R(x)u)dt \] where $Q(x)\succeq0$ and $R(x)\succ0$ are non - negative definite and positive definite symmetric matrix - valued functions, respectively. 3. **Value function**: \[ V(x_0):=\min_u J(x_0, u) \] 4. **HJB PDE**: \[ 0=\frac{\partial V^{\top}(x)}{\partial x}f(x)-\frac{1}{2}\frac{\partial V^{\top}(x)}{\partial x}g(x)R^{-1}(x)g^{\top}(x)\frac{\partial V(x)}{\partial x}+\frac{1}{2}x^{\top}Q(x)x \] 5. **Optimal control law**: \[ u^*(x)=-R^{-1}(x)g^{\top}(x)\frac{\partial V(x)}{\partial x} \] 6. **Value function approximation in Al’brekht method**: \[ V(x)\approx\frac{1}{2}x^{\top}V_2x+\frac{1}{2}\sum_{i = 3}^d v_i^{\top}x^i \] where $v_2=\text{vec}(V_2)$, and $V_2$ is the solution of the algebraic Riccati equation: \[ 0 = A^{\top}V_2+V_2A - V_2BR^{-1}B

Computing Solutions to the Polynomial-Polynomial Regulator Problem

Sequential Linear Quadratic Control of Bilinear Parabolic PDEs Based on POD Model Reduction

Output stabilization of boundary-controlled parabolic PDEs via gradient-based dynamic optimization

Iterative design of suboptimal feedback control for bilinear parabolic PDE systems

A Gradient-based Kernel Optimization Approach for Parabolic Distributed Parameter Control Systems

The Quadratic-Quadratic Regulator Problem: Approximating feedback controls for quadratic-in-state nonlinear systems

Local Stabilizing Control for Polynomial Nonlinear Systems

A Polynomial Chaos Approach to Stochastic LQ Optimal Control: Error Bounds and Infinite-Horizon Results

Accelerated Optimization Landscape of Linear-Quadratic Regulator

Control Synthesis for Polynomial Nonlinear Systems and Application in Attitude Control

Reduced Precision Solution Criteria for Nonlinear Model Predictive Control with the Feasibility-Perturbed Sequential Quadratic Programming Algorithm

Minimax Linear Regulator Problems for Positive Systems

Stochastic Linear Quadratic Regulators with Indefinite Control Weight Costs. II

Infinite-horizon Risk-constrained Linear Quadratic Regulator with Average Cost

Linear Quadratic Optimal Control for Systems Governed by First-Order Hyperbolic Partial Differential Equations

Synthesis of a Regulator for a Linear-Quadratic Optimal Control Problem

Direct integral pseudospectral and integral spectral methods for solving a class of infinite horizon optimal output feedback control problems using rational and exponential Gegenbauer polynomials

Polynomial Approximation of Value Functions and Nonlinear Controller Design with Performance Bounds

Learning Stabilizing Controllers of Linear Systems via Discount Policy Gradient

Optimality of Linear Policies for Distributionally Robust Linear Quadratic Gaussian Regulator with Stationary Distributions

Robust State Feedback Control Design with Probabilistic System Parameters