Generate Adversarial Examples by Spatially Perturbing on the Meaningful Area

Ting Deng,Zhigang Zeng
DOI: https://doi.org/10.1016/j.patrec.2019.06.028
IF: 4.757
2019-01-01
Pattern Recognition Letters
Abstract:Recently, research on adversarial attack and adversarial defense received more and more attention. Because although deep neural networks (DNNs) outperform humans in many tasks, studies have shown that they are easily fooled by human invisible disturbances. Identifying vulnerabilities in existing networks and further improving the robustness of the network is critical to the application of DNNs in real life. In this paper, a spatial transformed attack method based on the attention mechanism is proposed. The attention mechanism based on gradient-weighted class activation mapping (Grad-CAM) is used to find an meaningful attack area and the spatial transformation is performed on the this area to achieve adversarial attack. The experimental results show that the obtained method can reduce the magnitude of the disturbances needed to achieve the attack while ensuring the effectiveness. The adversarial examples generated by the obtained method shows smaller difference from the original sample under several different metrics. (C) 2019 Elsevier B.V. All rights reserved.
What problem does this paper attempt to address?