LayerOut: Freezing Layers in Deep Neural Networks

Kelam Goutam,S. Balasubramanian,Darshan Gera,R. Raghunatha Sarma
DOI: https://doi.org/10.1007/s42979-020-00312-x
2020-09-01
SN Computer Science
Abstract:Deep networks involve a huge amount of computation during the training phase and are prone to over-fitting. To ameliorate these, several conventional techniques such as DropOut, DropConnect, Guided Dropout, Stochastic Depth, and BlockDrop have been proposed. These techniques regularize a neural network by dropping nodes, connections, layers, or blocks within the network. However, these conventional regularization techniques suffers from limitation that, they are suited either for fully connected networks or ResNet-based architectures. In this research, we propose a novel regularization technique LayerOut to train deep neural networks which stochastically freeze the trainable parameters of a layer during an epoch of training. This technique can be applied to both fully connected networks and all types of convolutional networks such as VGG-16, ResNet, etc. Experimental evaluation on multiple dataset including MNIST, CIFAR-10, and CIFAR-100 demonstrates that LayerOut generalizes better than the conventional regularization techniques and additionally reduces the computational burden significantly. We have observed up to 70<span class="mathjax-tex">\(\%\)</span> reduction in computation per epoch and up to 2<span class="mathjax-tex">\(\%\)</span> improvement in classification accuracy as compared to the baseline networks (VGG-16 and ResNet-110) on above datasets. Codes are publically available at <a href="https://github.com/Goutam-Kelam/LayerOut">https://github.com/Goutam-Kelam/LayerOut</a>.
What problem does this paper attempt to address?