Direct Heuristic Dynamic Programming with Augmented States

Jian Sun,Feng Liu,Jennie Si,Shengwei Mei
DOI: https://doi.org/10.1109/ijcnn.2011.6033633
IF: 7.8
2011-01-01
Neural Networks
Abstract:This paper addresses a design issue of an approximate dynamic programming structure and its respective convergence property. Specifically, we propose to impose a PID structure to the action and critic networks in the direct heuristic dynamic programming (direct HDP) online learning controller. We demonstrate that the direct HDP with such PID augmented states improves convergence speed and that it out performs the traditional PID even though the learning controller may be initialized to be like a PID. Also for the first time, by using a Lyapnov approach we show that the action and critic network weights retain the property of uniformly ultimate boundedness (UUB) under mild conditions.
What problem does this paper attempt to address?