Partial Communication Model Based on the Gain of Q-value in Multi-agent Reinforcement Learning

Jie Xu,Wei,Ya Zhang,Peng Cui
2024-01-01
Abstract:Communication is crucial in multi-agent reinforcement learning (MARL), especially in scenarios where a large number of agents work in a collaborative way. Effective communication is instrumental in mitigating challenges associated with collaborative efforts and circumventing the pitfalls of strategy overfitting. However, existing works primarily focus on broadcast communication, which brings information redundancy and efficiency loss. To address these difficulties, we propose a Partial Communication Model based on the Gain of Q-value (PCGQ) that enables agents to communicate efficiently in a partially observable distributed environment to enhance cooperation. PCGQ utilizes a communication module unit that calculates the gain of the average Q-value generated by communication to determine whether to communicate with other agents in the observable field. The versatility of PCGQ is illustrated by its broad compatibility with frameworks that employ a Centralized Training with Decentralized Execution (CTDE) paradigm, incorporating a joint action-value function. We conducted experiments in environments focused on cooperative navigation and joint combat confrontation., which reveals that PCGQ significantly enhances performance, mitigates unnecessary information loss across diverse multi-agent cooperation scenarios, and concurrently diminishes communication costs.
What problem does this paper attempt to address?