PGN: A perturbation generation network against deep reinforcement learning

Xiangjuan Li,Feifan Li,Yang Li,Quan Pan
2023-12-20
Abstract:Deep reinforcement learning has advanced greatly and applied in many areas. In this paper, we explore the vulnerability of deep reinforcement learning by proposing a novel generative model for creating effective adversarial examples to attack the agent. Our proposed model can achieve both targeted attacks and untargeted attacks. Considering the specificity of deep reinforcement learning, we propose the action consistency ratio as a measure of stealthiness, and a new measurement index of effectiveness and stealthiness. Experiment results show that our method can ensure the effectiveness and stealthiness of attack compared with other algorithms. Moreover, our methods are considerably faster and thus can achieve rapid and efficient verification of the vulnerability of deep reinforcement learning.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper aims to study how to quickly and efficiently verify the vulnerabilities of Deep Reinforcement Learning (DRL) systems. Specifically, the authors propose a novel generative model called the Perturbation Generative Network (PGN) to create effective adversarial examples to attack DRL systems. Compared to traditional adversarial attack methods, PGN has the following advantages: 1. **Fast Generation of Adversarial Examples**: PGN can quickly generate adversarial examples, enabling real-time attacks. 2. **Stealthiness and Effectiveness**: The adversarial examples generated by PGN can not only effectively mislead DRL systems but also maintain a high level of stealthiness, making the attacks less detectable. 3. **Targeted and Non-Targeted Attacks**: PGN can achieve both targeted attacks on specific goals and non-targeted attacks. Additionally, the authors propose a new evaluation metric called the Action Consistency Ratio (ACR) to measure the stealthiness of the attacks. They combine reward, ACR, and PSNR to propose a comprehensive evaluation index called AR to fully assess the effectiveness and stealthiness of the attacks. Experimental results show that compared to traditional adversarial attack methods (such as FGSM, CW, and PGD), PGN ensures the effectiveness of the attacks while achieving higher stealthiness and lower time complexity.