Abstract:Convolutional neural networks (CNNs) have been extensively used in numerous remote sensing image detection tasks owing to their exceptional performance. Nevertheless, CNNs are often vulnerable to adversarial examples, limiting the uses in different safety-critical scenarios. Recently, how to efficiently detect adversarial examples and improve the robustness of CNNs has drawn considerable focus. The existing adversarial example detection methods require modifying CNNs, which not only affects the model performance but also greatly enhances training cost. With the purpose of solving these problems, this study proposes a detection algorithm for adversarial examples that does not need modification of the CNN models and can simultaneously retain the classification accuracy of normal examples. Specifically, we design a method to detect adversarial examples using frequency domain reconstruction. After converting the input adversarial examples into the frequency domain by Fourier transform, the adversarial disturbance from adversarial attacks can be eliminated by modifying the frequency of the example. The inverse Fourier transform is then used to maximize the recovery of the original example. Firstly, we train a CNN to reconstruct input examples. Then, we insert Fourier transform, convolution operation, and inverse Fourier transform into the features of the input examples to automatically filter out adversarial frequencies. We refer to our proposed method as FDR (frequency domain reconstruction), which removes adversarial interference by converting input samples into frequency and reconstructing them back into the spatial domain to restore the image. In addition, we also introduce gradient masking into the proposed FDR method to enhance the detection accuracy of the model for complex adversarial examples. We conduct extensive experiments on five mainstream adversarial attacks on three benchmark datasets, and the experimental results show that FDR can outperform state-of-the-art solutions in detecting adversarial examples. Additionally, FDR does not require any modifications to the detector and can be integrated with other adversarial example detection methods to be installed in sensing devices to ensure detection safety.

Feature Fusion Based Adversarial Example Detection Against Second-Round Adversarial Attacks

An Adversarial Attack Via Feature Contributive Regions

Adversarial Examples Detection of Radio Signals Based on Multifeature Fusion.

Detection Based Defense Against Adversarial Examples from the Steganalysis Point of View

D2Defend: Dual-Domain based Defense against Adversarial Examples

A Universal Defense Strategy Against Adversarial Attacks Based on Attention-Guided

Adversarial Examples Detection with Enhanced Image Difference Features based on Local Histogram Equalization

Adaptive Feature Alignment for Adversarial Training

Improving Adversarial Robustness via Feature Pattern Consistency Constraint

Enhancing Intrinsic Adversarial Robustness via Feature Pyramid Decoder

Adversarial Examples Detection Beyond Image Space.

CSFAdv: Critical Semantic Fusion Guided Least-Effort Adversarial Example Attacks

Fusion is Not Enough: Single Modal Attacks on Fusion Models for 3D Object Detection

Feature decoupling and interaction network for defending against adversarial examples

Hessian-Free Second-Order Adversarial Examples for Adversarial Learning

Playing Against Deep-Neural-Network-Based Object Detectors: A Novel Bidirectional Adversarial Attack Approach

A Novel Adversarial Example Detection Method Based on Frequency Domain Reconstruction for Image Sensors

Adversarial example defense based on image reconstruction

Improving Adversarial Robustness Against Universal Patch Attacks Through Feature Norm Suppressing

Detecting Adversarial Examples by Additional Evidence from Noise Domain

LFAA: Crafting Transferable Targeted Adversarial Examples with Low-Frequency Perturbations