Policy Approximation in Policy Iteration Approximate Dynamic Programming for Discrete-Time Nonlinear Systems.

Wentao Guo,Jennie Si,Feng Liu,Shengwei Mei
DOI: https://doi.org/10.1109/TNNLS.2017.2702566
IF: 14.255
2018-01-01
IEEE Transactions on Neural Networks and Learning Systems
Abstract:Policy iteration approximate dynamic programming (DP) is an important algorithm for solving optimal decision and control problems. In this paper, we focus on the problem associated with policy approximation in policy iteration approximate DP for discrete-time nonlinear systems using infinite-horizon undiscounted value functions. Taking policy approximation error into account, we demonstrate asympt...
What problem does this paper attempt to address?