A Hybrid Kernel Pruning Approach for Efficient and Accurate CNNs.

Xiao Yi,Bo Wang,Shengbai Luo,Tiejun Li,Lizhou Wu,Jianmin Zhang,Kenli Li,Sheng Ma
DOI: https://doi.org/10.1007/978-981-97-0862-8_3
2024-01-01
Abstract:To reduce the overhead of neural network training and inference, several techniques have been widely used to prune neural network models. Pruning algorithms can significantly reduce the number of parameters in the model, which in turn reduces the amount of computation required during model training and inference. Currently, the most popular pruning algorithm is the structured pruning algorithm, which prunes the model at the kernel level. Researchers usually use norm-based criteria to determine which kernels to prune. While this type of algorithms works well, there are some shortcomings. First, the effectiveness of the norm-based pruning algorithm lacks support from mathematical theories. Second, this pruning algorithm requires certain conditions to work well. To address these shortcomings, we propose a novel kernel pruning algorithm. Based on the observation that convolution kernels act as feature extractors, we design a functional similarity-based pruning algorithm as the criteria for selecting pruned kernels. Our experimental results show that when pruning ResNet with a high pruning ratio, this algorithm can obtain a sparse model with high accuracy. Moreover, when combined with the norm-based pruning algorithm, our functional similarity-based pruning algorithm can produce a more accurate model than either algorithm alone, even at the same pruning ratio.
What problem does this paper attempt to address?