Few2Decide: Towards a Robust Model Via Using Few Neuron Connections to Decide.

Li Jian,Guo Yanming,Lao Songyang,Zhao Xiang,Bai Liang,Wang Haoran
DOI: https://doi.org/10.1007/s13735-021-00223-4
2022-01-01
International Journal of Multimedia Information Retrieval
Abstract:Researches have shown that image classification networks are vulnerable to adversarial examples, which seriously limits their application in safely critical scenarios. Existing defense methods usually employ adversarial training or adjust the network structure to resist adversarial attack. Although these defense methods can improve the model robustness to some extent, they often significantly decrease the accuracy on the clean data and bring additional computational cost. In this work, we analyze the impact of adversarial example on neuron connections and propose a Few2Decide method to train a robust model by dropping part of non-robust connections in the fully connected layer. Our model can get high perturbed data accuracy without increasing trainable parameters, meanwhile, get high clean data accuracy. Experimental results prove that our method can provide a robust model and achieve state-of-the-art performance on the CIFAR-10 dataset. Specifically, our Few2Decide method achieves 73.01% adversarial accuracy on the CIFAR-10 dataset under the challenging untargeted attack in white-box settings with an attack strength 8/255, using ResNet-20[4$$\times $$ × ] architecture.
What problem does this paper attempt to address?