A defense method based on attention mechanism against traffic sign adversarial samples

Hailiang Li,Bin Zhang,Yu Zhang,Xilin Dang,Yuwei Han,Linfeng Wei,Yijun Mao,Jian Weng
DOI: https://doi.org/10.1016/j.inffus.2021.05.005
IF: 18.6
2021-12-01
Information Fusion
Abstract:<p>A traditional neural network cannot realize the invariance of image rotation and distortion well, so an attacker can fool the neural network by adding tiny disturbances to an image. If traffic signs are attacked, automatic driving will probably be misguided, leading to disastrous consequences. Inspired by the principle of human vision, this paper proposes a defense method based on an attentional mechanism for traffic sign adversarial samples. In this method, the affine coordinate parameters of the target objects in the images are extracted by a CNN, and then the target objects are redrawn by the coordinate mapping model. In this process, the key areas in the image are extracted by the attention mechanism, and the pixels are filtered by interpolation. Our model simulates the daily behavior of human beings, making it more intelligent in the defense against the adversarial samples. Experiments show that our method has a strong defense ability for traffic sign adversarial samples generated by various attack methods. Compared with other defense methods, our method is more universal and has a strong defense ability against a variety of attacks. Moreover, our model is portable and can be easily implanted into neural networks in the form of defense plug-ins.</p>
computer science, artificial intelligence, theory & methods
What problem does this paper attempt to address?
The paper primarily aims to address the vulnerability of deep neural networks (DNNs) when faced with adversarial examples, particularly the security challenges in the critical application of traffic sign recognition. The paper points out that traditional neural networks struggle to achieve invariance to image rotation and deformation, allowing attackers to mislead the neural network by adding small perturbations to the input image. If such an attack targets traffic signs, it could lead to misjudgments by autonomous driving systems, resulting in severe consequences. To solve this problem, the authors propose a defense method based on the attention mechanism to enhance the defense capability against adversarial examples of traffic signs. Specifically, this method is implemented through the following steps: 1. **Extract coordinate parameters**: Use convolutional neural networks (CNN) to extract affine coordinate parameters of the target object in the image. 2. **Redraw the target object**: Use a coordinate mapping model to redraw the target object based on the extracted coordinate parameters. 3. **Extract key areas**: During this process, use the attention mechanism to extract key areas of the image and filter pixels through interpolation to eliminate interference factors in the adversarial examples. This method simulates the characteristics of human visual behavior, enabling the model to more intelligently defend against adversarial examples. Experimental results show that this method has strong defense capabilities against adversarial examples of traffic signs generated by various attack methods. Compared with other existing defense methods, this method is more general and robust, and the model is highly portable, making it easy to embed as a defense plugin into neural networks. In short, the main contributions of the paper include: 1. Proposing a defense method based on the attention mechanism, enabling neural networks to focus on the key parts of the target object in the image while ignoring interference factors. 2. Designing a spatial transformation model that can transform important parts of the image into a new space and redraw them, generating new images. 3. Experimental results confirm that, compared to other defense models, the proposed model performs consistently across multiple neural networks and datasets, demonstrating better robustness and generality.