A Digital Receding-Horizon Learning Controller for Nonlinear Continuous-time Systems

Xinglong Zhang,Wenzhang Li,Xin Xu,Wei Jiang
DOI: https://doi.org/10.1016/j.ifacol.2020.12.2297
2020-01-01
IFAC-PapersOnLine
Abstract:Adaptive dynamic programming (ADP) has been recently studied to solve infinite-horizon optimal control problems of nonlinear continuoustime (CT) systems. In this paper, a receding-horizon actor-critic design (RH-ACD) method is proposed to solve the optimal control problem of nonlinear CT systems. In the proposed RH-ACD method, the recedinghorizon control strategy, which is originated from the idea of model predictive control (MPC). The actorcritic structure is designed to approximate the timedependent control policy and value function in each prediction horizon. The network weights of the actor and the critic are updated simultaneously online. The simulation results show that RH-ACD has improved control performance and reduced computational costs when compared with conventional MPC and infinitehorizon ADP.
What problem does this paper attempt to address?