Abstract: Deep convolutional neural network (DCNN for short) models are vulnerable to examples with small perturbations. Adversarial training (AT for short) is a widely used approach to enhance the robustness of DCNN models by data augmentation. In AT, the DCNN models are trained with clean examples and adversarial examples (AE for short) which are generated using a specific attack method, aiming to gain ability to defend themselves when facing the unseen AEs. However, in practice, the trained DCNN models are often fooled by the AEs generated by the novel attack methods. This naturally raises a question: can a DCNN model learn certain features which are insensitive to small perturbations, and further defend itself no matter what attack methods are presented. To answer this question, this paper makes a beginning effort by proposing a shallow binary feature module (SBFM for short), which can be integrated into any popular backbone. The SBFM includes two types of layers, i.e., Sobel layer and threshold layer. In Sobel layer, there are four parallel feature maps which represent horizontal, vertical, and diagonal edge features, respectively. And in threshold layer, it turns the edge features learnt by Sobel layer to the binary features, which then are feeded into the fully connected layers for classification with the features learnt by the backbone. We integrate SBFM into VGG16 and ResNet34, respectively, and conduct experiments on multiple datasets. Experimental results demonstrate, under FGSM attack with $\epsilon=8/255$, the SBFM integrated models can achieve averagely 35\% higher accuracy than the original ones, and in CIFAR-10 and TinyImageNet datasets, the SBFM integrated models can achieve averagely 75\% classification accuracy. The work in this paper shows it is promising to enhance the robustness of DCNN models through feature learning.

Explicitly Modeling Pre-Cortical Vision with a Neuro-Inspired Front-End Improves CNN Robustness

Matching the Neuronal Representations of V1 is Necessary to Improve Robustness in CNNs with V1-like Front-ends

A precortical module for robust CNNs to light variations

Brain inspired Robust Vision using Convolutional Neural Networks with Feedback

Convolutional neural networks for vision neuroscience: significance, developments, and outstanding issues

Improved modeling of human vision by incorporating robustness to blur in convolutional neural networks

Explaining V1 Properties with a Biologically Constrained Deep Learning Architecture

Learning From Brains How to Regularize Machines

Retinotopic Mapping Enhances the Robustness of Convolutional Neural Networks

Human Eyes Inspired Recurrent Neural Networks are More Robust Against Adversarial Noises

Human Eyes-Inspired Recurrent Neural Networks Are More Robust Against Adversarial Noises

A Visual Cortex-Attentive Deep Convolutional Neural Network for Digital Image Design

Leveraging the Human Ventral Visual Stream to Improve Neural Network Robustness

Convolutional Neural Networks Exploiting Attributes of Biological Neurons

Hierarchical Spiking-Based Model for Efficient Image Classification with Enhanced Feature Extraction and Encoding.

Receptive Field Refinement for Convolutional Neural Networks Reliably Improves Predictive Performance

Improving Model Robustness Against Adversarial Examples with Redundant Fully Connected Layer.

Improving the Robustness of Deep Convolutional Neural Networks Through Feature Learning

Bio-inspired Robustness: A Review

Assessing Visually-Continuous Corruption Robustness of Neural Networks Relative to Human Performance

Visual Analytics of Neuron Vulnerability to Adversarial Attacks on Convolutional Neural Networks