Dissipativity in Infinite Horizon Optimal Control and Dynamic Programming

David Angeli,Lars Grüne
DOI: https://doi.org/10.1007/s00245-024-10103-y
2024-02-24
Applied Mathematics & Optimization
Abstract:In this paper we extend dynamic programming techniques to the study of discrete-time infinite horizon optimal control problems on compact control invariant sets with state-independent best asymptotic average cost. To this end we analyse the interplay of dissipativity and optimal control, and propose novel recursive approaches for the solution of so called shifted Bellman Equations.
mathematics, applied
What problem does this paper attempt to address?
Based on the provided text content, the main problems that this paper attempts to solve are as follows: 1. **Expand the application scope of dynamic programming techniques**: The paper aims to apply dynamic programming techniques to the study of discrete - time infinite - horizon optimal control problems, especially those with state - independent optimal asymptotic average cost on compact control - invariant sets. By analyzing the interaction between dissipativity and optimal control, new recursive methods are proposed to solve the so - called "shifted Bellman equation". 2. **Introduce terminal penalty**: Introduce terminal penalty in infinite - horizon optimal control, in the form of a suitable storage function with a negative sign. This helps to deal with the long - term average performance problem in infinite - horizon optimal control problems. 3. **Propose the shifted Bellman equation**: For the non - zero (but state - independent) optimal long - term average performance problem, a shifted Bellman equation is proposed. This method is applicable to systems with periodic, almost periodic or even chaotic operation modes, allowing for general time - varying asymptotic costs along the optimal solution. 4. **Propose new recursive methods**: Two new recursive methods are proposed, whose fixed points are the solutions of the shifted Bellman equation. These methods have good convergence properties under fairly general assumptions and can calculate the optimal average performance and the corresponding value function simultaneously. 5. **Handle the trade - off between transient cost and asymptotic average performance**: Study how to minimize the transient cost without affecting the asymptotic average performance, which is an unsolved problem in existing methods. In general, by introducing new mathematical tools and methods, the paper aims to expand the application scope of dynamic programming in infinite - horizon optimal control problems, especially dealing with complex systems that are difficult to solve by traditional methods.