Feature Fusion Based Adversarial Example Detection Against Second-Round Adversarial Attacks

Chuan Qin,Yuefeng Chen,Kejiang Chen,Xiaoyi Dong,Weiming Zhang,Xiaofeng Mao,Yuan He,Nenghai Yu
DOI: https://doi.org/10.1109/tai.2022.3190816
2022-01-01
IEEE Transactions on Artificial Intelligence
Abstract:Convolutional neural networks (CNNs) achieve remarkable performances in various areas. However, adversarial examples threaten their security. They are designed to mislead CNNs to output incorrect results. Many methods are proposed to detect adversarial examples. Unfortunately, most detection-based defense methods are vulnerable to second-round adversarial attacks, which can simultaneously deceive the base model and the detector. To resist such second-round adversarial attacks, handcrafted steganalysis features are introduced to detect adversarial examples, while such a method receives low accuracy at detecting sparse perturbations. In this article, we propose to combine handcrafted features with deep features via a fusion scheme to increase the detection accuracy and defend against second-round adversarial attacks. To avoid deep features being overwhelmed by high-dimensional handcrafted features, we propose an expansion-then-reduction process to compress the dimensionality of handcrafted features. Experimental results show that the proposed model outperforms the state-of-the-art adversarial example detection methods and remains robust under various second-round adversarial attacks.
What problem does this paper attempt to address?