The Optimality Analysis of Hybrid Reinforcement Learning Combined with SVMs

Xue-ning Wang,Wei Chen,Da-xue Liu,Tao Wu,Han-gen He
DOI: https://doi.org/10.1109/isda.2006.268
2006-01-01
Abstract:To reduce the learning time of reinforcement learning (RL), hybrid algorithms that combines reinforcement learning with various supervised learning methods have attracted many research interests. However, the global convergence and optimality become one of the main problems for hybrid reinforcement learning algorithms. In this paper, the convergence of a hybrid RL algorithm, which is combined with support vector machines(SVMs)is analyzed theoretically. It is shown that by making use of policy gradient learning and the SVM regression, the hybrid algorithm can easily escape from local optima.
What problem does this paper attempt to address?