A Summary on Some Typical Adaptive Dynamic Programming Schemes

Qian Zhao,Chaoxu Mu,Weiqiang Liu
DOI: https://doi.org/10.23919/chicc.2018.8483984
2018-01-01
Abstract:This paper sums up four typical schemes of adaptive dynamic programming (ADP). The diagrams are provided and the algorithms of various schemes are described, which is convenient for comparison. Some schemes in this paper belong to the group of action-dependent (AD) adaptive critic designs, which features without a model network in the design. For simplicity of notation, we do not use the prefix AD. The learning process of ADP is accomplished by updating the weights of the networks. The weight updating processes of some networks in GDHP scheme are introduced.
What problem does this paper attempt to address?