Research on Deep Reinforcement Learning Algorithm Based on Dynamic Fusion Target

Zhixiong XU,Lei CAO,Yongliang ZHANG,Xiliang CHEN,Chenxi LI
DOI: https://doi.org/10.3778/j.issn.1002-8331.1712-0280
2019-01-01
Abstract:Aiming at the problem of overestimation in deep reinforcement learning algorithm, a target dynamic fusion mechanism is proposed. Based on the Deep Q Networks(DQN)algorithm, an improvement is proposed to reduce the overestimation in DQN algorithm by incorporating the update target of Sarsa algorithm, while retaining the DQN algorithm to speed up the learning process, dynamically combining the respective advantages of the DQN algorithm and the Sarsa algorithm, the DTDQN(Dynamic Target Deep Q Network)algorithm is proposed. The experiment of Carteole control problem on OpenAI Gym with open platform is carried out. The results show that DTDQN can effectively reduce the overvalue of the function, and improve the learning performance and the training stability obviously.
What problem does this paper attempt to address?