Phased Continuous Exploration Method for Cooperative Multi-Agent Reinforcement Learning

Jie Kang,Yaqing Hou,Yifeng Zeng,Yongchao Chen,Xiangrong Tong,Xin Xu,Qiang Zhang
DOI: https://doi.org/10.1109/cai59869.2024.00196
2024-01-01
Abstract:In multi-agent reinforcement learning, achieving effective exploration for agents remains challenging due to the non-stationarity of the environment and discrepancies between local and global information. In this paper, we propose a curiosity-driven phased continuous exploration method, termed PCE. We recognize that agents in different learning phases possess distinct knowledge and policies, allowing them to learn diverse knowledge and experiences from the same states. Therefore, we divide the training process of agents into different phases, employing a curiosity-driven method to explore independently within each phase. Simultaneously, addressing the characteristic of inconsistent local and global information in multi-agent systems, we strike a balance between exploration from local and global perspectives. Finally, we evaluate the proposed method in the popular multi-agent test task, StarCraft II. The results indicate that the method excels in enhancing the exploration capabilities of agents.
What problem does this paper attempt to address?