An Approximate Quadratic Programming for Efficient Bellman Equation Solution

Jianmei Su,Hong Cheng,Hongliang Guo,Rui Huang,Zhinan Peng
DOI: https://doi.org/10.1109/access.2019.2939161
IF: 3.9
2019-01-01
IEEE Access
Abstract:This paper proposes an efficient algorithm which relies on quadratic programming for approximately solving the Bellman equation in reinforcement learning problem and guarantees to return optimal decision parameters. Through further applying universal approximation and fixed cardinality minimization techniques, the proposed algorithm in one hand expands the representation ability of basic linear value functions, on the other hand, it guarantees the convergence of the Bellman error. Experimental results on two canonical reinforcement learning scenarios demonstrate that the proposed algorithm achieves similar or better performance than the state-of-the-art algorithms, while reduces the computation time significantly and improves the robustness of the algorithm against state uncertainty.
What problem does this paper attempt to address?