Learning-based adaptive optimal control of linear time-delay systems: A value iteration approach

Leilei Cui,Bo Pang,Miroslav Krstić,Zhong-Ping Jiang
DOI: https://doi.org/10.1016/j.automatica.2024.111944
IF: 6.4
2024-10-05
Automatica
Abstract:This paper proposes a novel learning-based adaptive optimal controller design method for a class of continuous-time linear time-delay systems. A key strategy is to exploit the state-of-the-art reinforcement learning (RL) techniques and adaptive dynamic programming (ADP), and propose a data-driven method to learn the near-optimal controller without the precise knowledge of system dynamics. Specifically, a value iteration (VI) algorithm is proposed to solve the infinite-dimensional Riccati equation for the linear quadratic optimal control problem of time-delay systems using finite samples of input-state trajectory data. It is rigorously proved that the proposed VI algorithm converges to the near-optimal solution. Compared with the previous literature, the nice features of the proposed VI algorithm are that it is directly developed for continuous-time systems without discretization and an initial admissible controller is not required for implementing the algorithm. The efficacy of the proposed methodology is demonstrated by two practical examples of metal cutting and autonomous driving.
automation & control systems,engineering, electrical & electronic
What problem does this paper attempt to address?