A Novel Reinforcement Learning Control for a Class of Strict-feedback Discrete-time Systems Via Multi-Gradient Recursive.

Weiwei Bai,Tieshan Li,Yue Long
DOI: https://doi.org/10.1109/spac53836.2021.9540001
2021-01-01
Abstract:This paper investigates the reinforcement learning control design problem via multi-gradient recursive (MGR) for a general class of strict feedback systems. The system with unknown control gain is studied, which is more general than the one with known control gain. The long-term strategic utility function is estimated by constructing a critic neural network (NN), and an actuator NN is constructed to estimate the unknown function in the controller. A novel learning strategy, the socalled MGR, is proposed to learning the weight vector, which can not only eliminate the local optimal problem that inherent in the gradient descent method but also improve the rate of convergency. According to the Lyapunov theory, all signals in the control system are ensured to be semiglobal uniformly ultimately bounded (SGUUB). At last, the comparison simulation examples are given to validate this strategy.
What problem does this paper attempt to address?