Model-based Credit Assignment for Model-free Deep Reinforcement Learning

Dong Yan,Jiayi Weng,Shiyu Huang,Chongxuan Li,Yichi Zhou,Hang Su,Jun Zhu
2020-01-01
Abstract:The combination of model-free reinforcement learning method and deep neural network brings significant progress, which makes the so-called deep reinforcement learning is able to solve a lot of real-world problems However, its performance is not ideal for complex tasks and the computation cost is usually high. These shortcomings limit its further application. In this paper, we propose a novel framework that utilizes the model-based method to assign credits for hundreds of thousands of non-terminal stateaction pairs, in order to overcome those drawbacks. Specifically, we first abstract the states and actions of the original problem into a compact representation, which reduces the problem to a tractable size. Then, we solve the abstract problem to obtain the optimal value function, which is the expected returns of future rewards. Finally, we use the derived value function to assign credits for state-action pairs of the original problem. We conduct extensive experiments on three different scenarios, from small to large, from single-task to multi-task. The experiment results have demonstrated that our agent outperforms previous state-ofthe-art methods both on the final performance and the training efficiency with a large margin. Based on the novel framework, we trained an agent to participate in an online video game competition and achieved the 2nd place in the final.
What problem does this paper attempt to address?