Discrete-time Generalized Policy Iteration ADP Algorithm with Approximation Errors.

Qinglai Wei,Benkai Li,Ruizhuo Song
DOI: https://doi.org/10.1109/ssci.2017.8285276
2017-01-01
Abstract:This paper concerns with a novel generalized policy iteration (GPI) algorithm with approximation errors. Approximation errors are explicitly considered in the GPI algorithm. The properties of the stable GPI algorithm with approximation errors are analyzed. The convergence of the developed algorithm is established to show that the iterative value function is convergent to a finite neighborhood of the optimal performance index function. Finally, numerical examples and comparisons are presented.
What problem does this paper attempt to address?