Adaptive optimal control of continuous-time nonlinear affine systems via hybrid iteration
Omar Qasem,Weinan Gao,Kyriakos G. Vamvoudakis
DOI: https://doi.org/10.1016/j.automatica.2023.111261
IF: 6.4
2023-09-06
Automatica
Abstract:In this paper, a novel successive approximation framework, named hybrid iteration (HI), is proposed to fill up the performance gap between two well-known dynamic programming algorithms, namely policy iteration (PI) and value iteration (VI). Using HI, an approximated optimal control policy can be learned without prior knowledge of an initial admissible control policy required by PI. Additionally, the HI algorithm converges to the optimal solution much faster than VI, and thus requires tremendously less number of learning iterations and CPU-time, compared to VI. Initially, we develop a model-based HI algorithm, and then extend it to a data-driven HI algorithm which learns the optimal control policy without any information of the physics of the system. Simulation results demonstrate the efficacy of the proposed HI algorithm.
automation & control systems,engineering, electrical & electronic