A Novel On-Line Vi-Adp For Nonlinear Discrete-Time Systems

Chun Li,Jinliang Ding,Changxin Liu,Frank L. Lewis
DOI: https://doi.org/10.1109/ICCA.2019.8899631
2019-01-01
Abstract:In this paper, a novel on-line value iteration (VI) adaptive (or approximate) dynamic programming (ADP) is developed to solve the optimal control problems for nonlinear discrete-time affine systems, which updated the value function and policy function at the each step running of the system. The critical point of this algorithm is to combine local VI-ADP algorithm with gaussian distribution regarded the current states as the expected. Moreover, for implementing the on-line VI-ADP algorithm, the actor-critic structure composed of the neural networks realizes the value function and the strategy function. In addition, the model neural network enables the algorithm to replace the system dynamics in a data-driven way, that is, without of the system dynamic knowledge is feasible. Finally, simulations show that this on-line VI-ADP algorithm can solve the regulation and tracking problems effectively.
What problem does this paper attempt to address?