Clustering experience replay for the effective exploitation in reinforcement learning

Min Li,Tianyi Huang,William Zhu
DOI: https://doi.org/10.1016/j.patcog.2022.108875
IF: 8
2022-01-01
Pattern Recognition
Abstract:•The limitation of the exploitation efficiency in existing reinforcement learning methods is analyzed in detail.•Clustering is combined into the experience replay by a divide-and-conquer framework to improve the exploitation efficiency.•Our experience replay can sufficiently replay all kinds of transitions in the current training with low time consumption.•A new reinforcement learning method is proposed to implement our experience replay.
What problem does this paper attempt to address?