Abstract:Recent works have revealed that backdoor attacks against Deep Reinforcement Learning (DRL) could lead to abnormal action selections of the agent, which may result in failure or even catastrophe in crucial decision processes. However, existing attacks only consider single-agent reinforcement learning (RL) systems, in which the only agent can observe the global state and have full control of the decision process. In this article, we explore a new backdoor attack paradigm in cooperative multi-agent reinforcement learning (CMARL) scenarios, where a group of agents coordinate with each other to achieve a common goal, while each agent can only observe the local state. In the proposed MARNet attack framework, we carefully design a pipeline of trigger design, action poisoning, and reward hacking modules to accommodate the cooperative multi-agent settings. In particular, as only a subset of agents can observe the triggers in their local observations, we maneuver their actions to the worst actions suggested by an expert policy model. Since the global reward in CMARL is aggregated by individual rewards from all agents, we propose to modify the reward in a way that boosts the bad actions of poisoned agents (agents who observe the triggers) but mitigates the influence on non-poisoned agents. We conduct extensive experiments on three classical CMARL algorithms VDN, COMA, and QMIX, in two popular CMARL games Predator Prey and SMAC. The results show that the baselines extended from single-agent DRL backdoor attacks seldom work in CMARL problems while MARNet performs well by reducing the utility under attack by nearly 100%. We apply fine-tuning as a potential defense against MARNet and demonstrate that fine-tuning cannot entirely eliminate the effect of the attack.

Multiple-Model Based Defense for Deep Reinforcement Learning Against Adversarial Attack

MARNet: Backdoor Attacks Against Cooperative Multi-Agent Reinforcement Learning

Towards Secure Multi-Agent Deep Reinforcement Learning: Adversarial Attacks and Countermeasures

Robust Multi-Agent Reinforcement Learning against Adversaries on Observation

Robust Deep Reinforcement Learning against Adversarial Perturbations on State Observations

Deep-Attack over the Deep Reinforcement Learning

Multi-Agent Guided Deep Reinforcement Learning Approach Against State Perturbed Adversarial Attacks

Understanding Adversarial Attacks on Observations in Deep Reinforcement Learning

Evading Machine Learning Botnet Detection Models via Deep Reinforcement Learning

Robustifying Reinforcement Learning Agents via Action Space Adversarial Training

Adversarial Attacks on Multiagent Deep Reinforcement Learning Models in Continuous Action Space

Selective Real‐time Adversarial Perturbations Against Deep Reinforcement Learning Agents

Curiosity-Driven and Victim-Aware Adversarial Policies.

Adversarial Deep Reinforcement Learning for Cyber Security in Software Defined Networks

Robust Deep Reinforcement Learning with Adversarial Attacks

Optimal Attack and Defense for Reinforcement Learning

Adversarial robustness of deep reinforcement learning-based intrusion detection

Characterizing Attacks on Deep Reinforcement Learning

Enhanced adversarial strategically-timed attacks against deep reinforcement learning

Camouflage Adversarial Attacks on Multiple Agent Systems

Deep Reinforcement Learning for Cyber System Defense under Dynamic Adversarial Uncertainties