Trustable Policy Collaboration Scheme for Multi-Agent Stigmergic Reinforcement Learning.

Xing Xu,Rongpeng Li,Zhifeng Zhao,Honggang Zhang
DOI: https://doi.org/10.1109/lcomm.2022.3144451
IF: 3.5529
2022-01-01
IEEE Communications Letters
Abstract:In this letter, we propose a trustable policy collaboration scheme in the paradigm of multi-agent independent reinforcement learning (MAIRL). This trustable policy collaboration scheme is realized by directly mixing the policy parameters of homogeneous agents, for which an upper bound of the mixture metric is derived to guarantee the policy improvement. This trustable policy collaboration scheme can update the behavioral policies of agents distributedly and further improve the performance of MAIRL. In addition, we develop a practical implementation of this trustable policy collaboration scheme, and verify its effectiveness in a mixed-autonomy traffic control simulation scenario through the performance comparison with other typical methods.
What problem does this paper attempt to address?