On policy iteration‐based discounted optimal control

Botao Dong,Longyang Huang,Xiwen Ma,Weidong Zhang
DOI: https://doi.org/10.1002/rnc.7245
IF: 3.8973
2024-02-17
International Journal of Robust and Nonlinear Control
Abstract:This article investigates the policy iteration (PI) method for the discounted optimal control (DOC) problem of continuous‐time linear systems. We show the properties and convergence of the PI method. The theory analysis shows that the convergence of PI can be ensured without requiring the initial admissible control gain. The convergence rate of the PI method is provided. An iteration‐termination criterion is established for detecting the stability of the closed‐loop system under the control gain obtained by executing PI. Two kinds of data‐driven implementations are constructed without using prior information of the system dynamics. A simulation example is presented to validated the properties of the PI method.
automation & control systems,engineering, electrical & electronic,mathematics, applied
What problem does this paper attempt to address?