Discrete-time optimal control scheme based on Q-learning algorithm

Qinglai Wei,Derong Liu,Ruizhuo Song
DOI: https://doi.org/10.1109/ICICIP.2016.7885888
2016-01-01
Abstract:This paper is concerned with optimal control problems of discrete-time nonlinear systems via a novel Q-learning algorithm. In the newly developed Q-learning algorithm, the iterative Q function in each iteration is required to update on the whole state and control spaces, instead of being updated by a single state and control pair. A new convergence criterion of the corresponding Q-learning algorithm is presented, where the traditional constraints for the learning rates of Q-learning algorithms is relaxed. Finally, simulation results are provided to exemplify the good performance of the developed algorithm.
What problem does this paper attempt to address?