Online Robustness Training for Deep Reinforcement Learning

Marc Fischer,Matthew Mirman,Steven Stalder,Martin Vechev
DOI: https://doi.org/10.48550/arXiv.1911.00887
2019-11-22
Abstract:In deep reinforcement learning (RL), adversarial attacks can trick an agent into unwanted states and disrupt training. We propose a system called Robust Student-DQN (RS-DQN), which permits online robustness training alongside Q networks, while preserving competitive performance. We show that RS-DQN can be combined with (i) state-of-the-art adversarial training and (ii) provably robust training to obtain an agent that is resilient to strong attacks during training and evaluation.
Machine Learning
What problem does this paper attempt to address?