Discrete-Time Nonlinear Generalized Policy Iteration for Optimal Control Using Neural Networks

Qinglai Wei,Derong Liu,Xiong Yang
DOI: https://doi.org/10.1007/978-3-319-12637-1_49
2014-01-01
Abstract:In this paper, a new generalized policy iteration (GPI) based adaptive dynamic programming (ADP) algorithm is developed to solve optimal control problems for infinite horizon discrete-time nonlinear systems. The GPI algorithm is a general idea of interacting policy and value iteration algorithms of ADP. There are two iteration indices, which iterate for policy improvement and policy evaluation, respectively, in the GPI algorithm. The convergence properties of the GPI algorithm are developed. Finally, simulation results are presented to illustrate the performance of the developed algorithm.
What problem does this paper attempt to address?