Off-policy integral reinforcement learning optimal tracking control for continuous-time chaotic systems

Wei Qinglai,Song Ruizhuo,Sun Qiuye,Xiao Wen-Dong
DOI: https://doi.org/10.1088/1674-1056/24/9/090504
2015-01-01
Chinese Physics B
Abstract:This paper estimates an off-policy integral reinforcement learning (IRL) algorithm to obtain the optimal tracking control of unknown chaotic systems. Off-policy IRL can learn the solution of the HJB equation from the system data generated by an arbitrary control. Moreover, off-policy IRL can be regarded as a direct learning method, which avoids the identification of system dynamics. In this paper, the performance index function is first given based on the system tracking error and control error. For solving the Hamilton-Jacobi-Bellman (HJB) equation, an off-policy IRL algorithm is proposed. It is proven that the iterative control makes the tracking error system asymptotically stable, and the iterative performance index function is convergent. Simulation study demonstrates the effectiveness of the developed tracking control method.
What problem does this paper attempt to address?