Abstract:Convolutional neural networks (CNNs) have been extensively used in numerous remote sensing image detection tasks owing to their exceptional performance. Nevertheless, CNNs are often vulnerable to adversarial examples, limiting the uses in different safety-critical scenarios. Recently, how to efficiently detect adversarial examples and improve the robustness of CNNs has drawn considerable focus. The existing adversarial example detection methods require modifying CNNs, which not only affects the model performance but also greatly enhances training cost. With the purpose of solving these problems, this study proposes a detection algorithm for adversarial examples that does not need modification of the CNN models and can simultaneously retain the classification accuracy of normal examples. Specifically, we design a method to detect adversarial examples using frequency domain reconstruction. After converting the input adversarial examples into the frequency domain by Fourier transform, the adversarial disturbance from adversarial attacks can be eliminated by modifying the frequency of the example. The inverse Fourier transform is then used to maximize the recovery of the original example. Firstly, we train a CNN to reconstruct input examples. Then, we insert Fourier transform, convolution operation, and inverse Fourier transform into the features of the input examples to automatically filter out adversarial frequencies. We refer to our proposed method as FDR (frequency domain reconstruction), which removes adversarial interference by converting input samples into frequency and reconstructing them back into the spatial domain to restore the image. In addition, we also introduce gradient masking into the proposed FDR method to enhance the detection accuracy of the model for complex adversarial examples. We conduct extensive experiments on five mainstream adversarial attacks on three benchmark datasets, and the experimental results show that FDR can outperform state-of-the-art solutions in detecting adversarial examples. Additionally, FDR does not require any modifications to the detector and can be integrated with other adversarial example detection methods to be installed in sensing devices to ensure detection safety.

A data‐driven adversarial examples recognition framework via adversarial feature genomes

Attack As Defense: Characterizing Adversarial Examples Using Robustness.

Attack As Detection: Using Adversarial Attack Methods to Detect Abnormal Examples.

An Adversarial Attack Via Feature Contributive Regions

A Universal Defense Strategy Against Adversarial Attacks Based on Attention-Guided

Adversarial Examples Detection with Enhanced Image Difference Features based on Local Histogram Equalization

Improving the Robustness of Deep Convolutional Neural Networks Through Feature Learning

FCGSM: Fast Conjugate Gradient Sign Method for Adversarial Attack on Image Classification

A Framework for Robust Deep Learning Models Against Adversarial Attacks Based on a Protection Layer Approach

Model-agnostic Adversarial Example Detection via High-Frequency Amplification

Adaptive Feature Alignment for Adversarial Training

An efficient adversarial example generation algorithm based on an accelerated gradient iterative fast gradient

A novel and universal GAN-based countermeasure to recover adversarial examples to benign examples

Class-aware domain adaptation for improving adversarial robustness

GCSA: A New Adversarial Example-Generating Scheme Towards Black-Box Adversarial Attacks

Attention‐guided transformation‐invariant attack for black‐box adversarial examples

Adversarial Examples Detection Through the Sensitivity in Space Mappings.

A General Framework for Adversarial Examples with Objectives

A Novel Adversarial Example Detection Method Based on Frequency Domain Reconstruction for Image Sensors

Adversarial Example Games

Adversarial Examples Detection of Radio Signals Based on Multifeature Fusion.