Pruning Convolutional Neural Networks via Stochastic Gradient Hard Thresholding

Xin Yang,Haiwei Lu,Hui Shuai,Xiao-Tong Yuan
DOI: https://doi.org/10.1007/978-3-030-31654-9_32
2019-01-01
Abstract:AbstractIn this paper, we introduce an iterative filter pruning method to compress deep convolutional neural networks. Different from many existing network compression methods which fall into the training-pruning-fine-tuning framework, we propose a new hybrid stochastic gradient hard thresholding algorithm (AHSG-HT) for adaptive structured pruning. Our approach trains the original network from scratch and continually prunes the “unimportant” filters by setting their values to zero in order to let algorithm automatically select filters. After training, we adopt physically pruning method to discard the filters with zero weights. Finally, we reconstruct the network and obtain a compact network without fine-tuning which also has a comparable performance. We conduct several experiments using ResNet on CIFAR-10 and CIFAR-100. Our approach achieves 92.40$$\%$$ accuracy on ResNet-56 with 63$$\%$$ reduction in parameters and FLOPs. The accuracy of CIFAR-10 on Resnet-110 with 20$$\%$$ pruning rate can reach up to 94.09$$\%$$, which exceeds the baseline accuracy of the full network without pruning.
What problem does this paper attempt to address?