Prioritized Experience Replay for Multi-agent Cooperation

Zirong HUANG,Yansong NING,Li WANG
DOI: https://doi.org/10.16355/j.cnki.issn1007-9432tyut.2021.05.008
2021-01-01
Abstract:In order to build a good experience replay buffer for multi-agent deep reinforcement learning, a prioritized experience replay algorithm was proposed for cooperative multi-agent learning. This algorithm introduces the idea of prioritized experience replay into the MAAC algorithm. During the training stage, the algorithm marks the importance of experience data based on the proportional prioritization calculated by the TD error, then uses the higher priority experience data to update the network. Experimental results show that the algorithm in this paper improves the quality of training data, thereby improving the speed of model convergence and learning efficiency. And the performance of the algorithm in the the cooperative treasure hunt and rover-tower environments is better than that of baseline algorithm.
What problem does this paper attempt to address?