Discrete-Time Stable Generalized Self-Learning Optimal Control With Approximation Errors.

Qinglai Wei,Benkai Li,Ruizhuo Song
DOI: https://doi.org/10.1109/TNNLS.2017.2661865
IF: 14.255
2018-01-01
IEEE Transactions on Neural Networks and Learning Systems
Abstract:In this paper, a generalized policy iteration (GPI) algorithm with approximation errors is developed for solving infinite horizon optimal control problems for nonlinear systems. The developed stable GPI algorithm provides a general structure of discrete-time iterative adaptive dynamic programming algorithms, by which most of the discrete-time reinforcement learning algorithms can be described usin...
What problem does this paper attempt to address?