DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning

Canzhe Zhao,Yanjie Ze,Jing Dong,Baoxiang Wang,Shuai Li

2023-08-19

Abstract:Communication lays the foundation for cooperation in human society and in multi-agent reinforcement learning (MARL). Humans also desire to maintain their privacy when communicating with others, yet such privacy concern has not been considered in existing works in MARL. To this end, we propose the \textit{differentially private multi-agent communication} (DPMAC) algorithm, which protects the sensitive information of individual agents by equipping each agent with a local message sender with rigorous $(\epsilon, \delta)$-differential privacy (DP) guarantee. In contrast to directly perturbing the messages with predefined DP noise as commonly done in privacy-preserving scenarios, we adopt a stochastic message sender for each agent respectively and incorporate the DP requirement into the sender, which automatically adjusts the learned message distribution to alleviate the instability caused by DP noise. Further, we prove the existence of a Nash equilibrium in cooperative MARL with privacy-preserving communication, which suggests that this problem is game-theoretically learnable. Extensive experiments demonstrate a clear advantage of DPMAC over baseline methods in privacy-preserving scenarios.

Machine Learning

What problem does this paper attempt to address?

The paper attempts to address the problem of achieving communication in multi-agent reinforcement learning (MARL) while protecting the privacy of individual agents. Specifically, although existing MARL research has demonstrated the importance of communication in promoting agent cooperation, these methods often overlook the potential privacy leakage issues during communication. For example, in autonomous driving systems, each car acts as an agent, and they need to communicate to improve the overall safety and efficiency of the system, but at the same time, they need to protect the personal privacy information of the car owners from being leaked. To solve this problem, the authors propose an algorithm called "Differential Privacy Multi-Agent Communication" (DPMAC). This algorithm equips each agent with a local message sender and ensures that its communication process meets the requirements of differential privacy (DP). Unlike the method of directly adding noise to the messages, DPMAC employs a random message sender, which mitigates the instability caused by adding DP noise by adjusting the learned message distribution. Additionally, the paper proves the existence of a Nash equilibrium in collaborative MARL with privacy-preserving communication, indicating that such problems are learnable. Through extensive experimental validation, DPMAC significantly outperforms baseline methods in privacy-preserving scenarios and achieves competitive performance even without privacy constraints. Overall, the main contribution of this paper is the first proposal of a solution for achieving private communication within the MARL framework and demonstrating that this solution is theoretically feasible.

DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning

MARNet: Backdoor Attacks Against Cooperative Multi-Agent Reinforcement Learning

PMAC: Personalized Multi-Agent Communication

Privacy-Engineered Value Decomposition Networks for Cooperative Multi-Agent Reinforcement Learning

PP-MARL: Efficient Privacy-Preserving MARL for Cooperative Intelligence in Communication

Hiding in Plain Sight: Differential Privacy Noise Exploitation for Evasion-resilient Localized Poisoning Attacks in Multiagent Reinforcement Learning

Differentially Private Average Consensus of Multi-Agent Systems via Privacy-Based Dynamic Self-Triggered Mechanism

Privacy Preserving Multi-Agent Reinforcement Learning in Supply Chains

Generalizing Differentially Private Decentralized Deep Learning with Multi-Agent Consensus

Learning Effective Communication for Cooperative Pursuit with Multi-Agent Reinforcement Learning

Differentially Private Reinforcement Learning with Self-Play

Group Decision-Making among Privacy-Aware Agents

DCMAC: Demand-aware Customized Multi-Agent Communication via Upper Bound Training

DPMLBench: Holistic Evaluation of Differentially Private Machine Learning

Robust Communicative Multi-Agent Reinforcement Learning with Active Defense

R-MADDPG for Partially Observable Environments and Limited Communication

Depthwise Convolution for Multi-Agent Communication With Enhanced Mean-Field Approximation

Decentralized Multi-Agent Reinforcement Learning: An Off-Policy Method

Multi-Agent Reinforcement Learning With Privacy Preservation for Continuous Double Auction-Based P2P Energy Trading

Partial Communication Model Based on the Gain of Q-value in Multi-agent Reinforcement Learning

Effective Communications: A Joint Learning and Communication Framework for Multi-Agent Reinforcement Learning over Noisy Channels