Abstract:The increasing use of deep neural networks exposes themselves to adversarial attacks in the real world drawn from closed-set and open-set, which poses great threats to their application in safety-critical sys-tems. Since adversarial attacks tend to mislead an original model by adding small perturbations into clean images, an intuitive idea of defensing adversarial attacks is eliminating perturbations as much as possi-ble to mitigate attacking effects. However, such elimination-based strategies unfortunately fail to achieve satisfactory robustness. Aiming to investigate the intrinsic reasons for this phenomenon, systematic ex-periments are carried out in this paper to indicate that even a 20% residual perturbation can still preserve and exhibit attacking effects as strong as a full one. Our study also indicates that there are strong cor-relations between perturbations and legitimate images. Thus, breaking the correlation across multiple bands is more effective in mitigating attacking effects. Based on these findings, this paper proposes an efficient defense strategy called "Frequency-Adaptive Compression and rEconstruction (FACE)" to improve the robustness of the model to adversarial attacks. Specifically, low-frequency bands containing semantic information are compressed by a down-sampling operation, while the channel width of high-frequency bands is squeezed and further compressed by adding noise before the Tanh activating function. Mean-while, attachment spaces of perturbations are also squeezed to the extent as much as possible. Finally, a clean output is obtained by upsampling together with expanded reconstruction. Experiments are ex-tensively conducted on widely used datasets to demonstrate the effectiveness of the proposed method. For closed-set attacks, FACE outperforms the STOA elimination-based methods on ImageNet, achieving a 27.9% improvement. For the MNIST open-set attacks, it not only reduces the success rate of targeted at-tack by a large margin (from 100% to 24.7%), but also mitigates attacking effects with an FPR-95 value of 0.3.(c) 2023 Elsevier Ltd. All rights reserved.

Improving Adversarial Robustness Against Universal Patch Attacks Through Feature Norm Suppressing

Defending Against Universal Adversarial Patches by Clipping Feature Norms

An Adversarial Attack Via Feature Contributive Regions

A Universal Defense Strategy Against Adversarial Attacks Based on Attention-Guided

PatchGuard: A Provably Robust Defense against Adversarial Patches via Small Receptive Fields and Masking

Improving the Robustness of Deep Convolutional Neural Networks Through Feature Learning

Improving Adversarial Robustness via Feature Pattern Consistency Constraint

I Don't Know You, But I Can Catch You: Real-Time Defense against Diverse Adversarial Patches for Object Detectors

Simultaneously Optimizing Perturbations and Positions for Black-box Adversarial Patch Attacks.

PatchGuard++: Efficient Provable Attack Detection against Adversarial Patches

Using Frequency Attention to Make Adversarial Patch Powerful Against Person Detector

Investigating and unmasking feature-level vulnerabilities of CNNs to adversarial perturbations

PatchZero: Defending against Adversarial Patch Attacks by Detecting and Zeroing the Patch

Defense Against Adversarial Attacks with Efficient Frequency-Adaptive Compression and Reconstruction

Improving Adversarial Robustness of Detector Via Objectness Regularization

Defending Adversarial Patches via Joint Region Localizing and Inpainting

Certified Defense Against Patch Attacks Via Mask-Guided Randomized Smoothing

Adaptive Feature Alignment for Adversarial Training

Enhancing Intrinsic Adversarial Robustness via Feature Pyramid Decoder

Improving Adversarial Robustness via Decoupled Visual Representation Masking

Adversarial Feature Augmentation and Normalization for Visual Recognition