Improved Deep Deterministic Policy Gradient Algorithm Based on Prioritized Sampling

HaoYu Zhang,Kai Xiong,Jie Bai
DOI: https://doi.org/10.1007/978-981-13-2288-4_21
2018-01-01
Abstract:Deep reinforcement learning tends to have low sampling efficiency, and prioritized sampling algorithm can improve the sampling efficiency to a certain extent. The prioritized sampling algorithm can be used in deep deterministic policy gradient algorithm, and a small sample sorting method is proposed to solve the problem of high complexity of the common prioritized sampling algorithm. Simulation experiments prove that the improved deep deterministic policy gradient algorithm improves the sampling efficiency and the training performance is better.
What problem does this paper attempt to address?