Enhancing the robustness of QMIX against state-adversarial attacks

Weiran Guo,Guanjun Liu,Ziyuan Zhou,Ling Wang,Jiacun Wang
DOI: https://doi.org/10.1016/j.neucom.2023.127191
IF: 6
2024-01-07
Neurocomputing
Abstract:Multi-Agent Reinforcement Learning (MARL) trains the decision models of cooperative agents by making them gain the highest rewards. The Centralized Training with Decentralized Execution approach (CTDE) can effectively address some challenging issues faced by MARL, including convergence, stability, and scalability, but it cannot handle the robustness issue. However, the performance of a model trained by MARL can be seriously impacted by state-adversarial attacks that are viewed as the perturbations applied to an agent's observation. Most recent research has concentrated on robust Single-Agent Reinforcement Learning (SARL) against state-adversarial attacks. However, there has not yet been too much work on robust MARL. QMIX is one of the popular cooperative MARL algorithms based on CTDE, but there is no study about its robustness. This work shows that QMIX is also sensitive to state-adversarial attacks. Inspired by four existing techniques of enhancing the robustness of SARL, we propose four methods to enhance the robustness of QMIX against five types of attacks. Our experiments illustrate the strengths and weaknesses of these methods against the five attacks, and an in-depth analysis is provided as well.
computer science, artificial intelligence
What problem does this paper attempt to address?