Defense Against Adversarial Attacks via Adversarial Noise Denoising Networks in Image Recognition

Chengxuan Li,Zhou Yang,Yang Xiao,Haozhao Liu,Yuning Zhang,Qingqi Pei
DOI: https://doi.org/10.1109/NaNA60121.2023.00092
2023-01-01
Abstract:Deep learning-based image recognition technology has significantly advanced the development of modern industrial intelligence. However, the issue of image adversarial examples that follows has gradually garnered the attention of researchers. By injecting adversarial disturbances that are difficult for humans to detect into the image, the deep learning model generates incorrect results, severely impacting the security of deep learning applications. To address this problem and improve the accuracy and robustness of deep learning image recognition models, a combined adversarial examples detection method based on residual learning and difference assessment has been proposed. To start, an image denoising module, based on residual learning and named ANDNet, is designed. This method incorporates a depth separable convolution structure and conducts step processing on the channel number of the hidden layer in ANDNet, which significantly reduces the model's memory consumption and improves its computational efficiency. In addition, the difference assessment involves computing the l <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</inf> norm distance between the original image and the denoised image softmax output of the classification model, to measure the disparity between the input image before and after denoising. The obtained detection threshold from the training dataset is integrated with this difference evaluation to accomplish the task of testing the input image for adversarial detection. Both theoretical analysis and experimental results confirm the effectiveness of the ANDNet noise reduction network, as well as the efficacy of the adversarial examples detection scheme in identifying adversarial examples. The model exhibits exceptional performance in terms of detection accuracy and F1 value.
What problem does this paper attempt to address?