Local Pruning Global Pruned Network under Knowledge Distillation

Hanyi Luo,Hao Tang,Kun Zhan
DOI: https://doi.org/10.1117/12.2626592
2021-01-01
Abstract:Network pruning has achieved great success in the compression and acceleration of neural networks on resource- limited devices. Previous pruning algorithms utilize filter pruning or channel pruning with the definition of a specific global or local pruning rate. Conventional pruning only finds or considers global or local pruning rates. In the only consideration of global pruning works, they ignore the individual characteristics of each layer. Similarly, only consideration of local pruning works could lead to a fragmented connection between layers. In this paper, we propose a novel method named global and local pruning under knowledge distillation (GLKD) by a combination of filter pruning and channel pruning technology, which is trained with a mixture of global and local pruning rates. The proposed algorithm, GLKD, can accelerate the inference of ResNet-110 to 56.2% speed-up with 0.17% accuracy increase on the CIFAR-100 dataset, which has great trade-offs in accuracy and compression. Additionally, the experiments of GLKD on ImageNet with ResNet-56 and ResNet-110 are conducted to prove its effectiveness on the compressed model. Moreover, the knowledge distillation is adopted on the pruning step in GLKD algorithm and improves the accuracy of the pruned network.
What problem does this paper attempt to address?