Efficient Joint Optimization of Layer-Adaptive Weight Pruning in Deep Neural Networks

Kaixin Xu,Zhe Wang,Xue Geng,Jie Lin,Min Wu,Xiaoli Li,Weisi Lin
2023-08-24
Abstract:In this paper, we propose a novel layer-adaptive weight-pruning approach for Deep Neural Networks (DNNs) that addresses the challenge of optimizing the output distortion minimization while adhering to a target pruning ratio constraint. Our approach takes into account the collective influence of all layers to design a layer-adaptive pruning scheme. We discover and utilize a very important additivity property of output distortion caused by pruning weights on multiple layers. This property enables us to formulate the pruning as a combinatorial optimization problem and efficiently solve it through dynamic programming. By decomposing the problem into sub-problems, we achieve linear time complexity, making our optimization algorithm fast and feasible to run on CPUs. Our extensive experiments demonstrate the superiority of our approach over existing methods on the ImageNet and CIFAR-10 datasets. On CIFAR-10, our method achieves remarkable improvements, outperforming others by up to 1.0% for ResNet-32, 0.5% for VGG-16, and 0.7% for DenseNet-121 in terms of top-1 accuracy. On ImageNet, we achieve up to 4.7% and 4.6% higher top-1 accuracy compared to other methods for VGG-16 and ResNet-50, respectively. These results highlight the effectiveness and practicality of our approach for enhancing DNN performance through layer-adaptive weight pruning. Code will be available on <a class="link-external link-https" href="https://github.com/Akimoto-Cris/RD_VIT_PRUNE" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### Problems Addressed by the Paper The paper primarily addresses the following issues: 1. **Optimization Objective**: Proposes a new layer-adaptive weight pruning method to tackle the challenge of simultaneously minimizing output distortion and meeting the target pruning ratio constraint in deep neural networks (DNNs). 2. **Pruning Scheme**: Designs a layer-adaptive pruning scheme by considering the overall impact of all layers. It discovers and leverages the additive property of output distortion caused by multi-layer pruning, transforming the pruning problem into a combinatorial optimization problem, which is efficiently solved through dynamic programming. 3. **Algorithm Performance Improvement**: By decomposing the problem into subproblems, it achieves linear time complexity, enabling the optimization algorithm to run quickly on a CPU. Experimental results show that this method outperforms existing methods on the ImageNet and CIFAR-10 datasets. For example, on CIFAR-10, the top-1 accuracy for ResNet-32, VGG-16, and DenseNet-121 improved by 1.0%, 0.5%, and 0.7% respectively; on ImageNet, the top-1 accuracy for VGG-16 and ResNet-50 improved by 4.7% and 4.6% respectively. These results highlight the effectiveness and practicality of the method in improving DNN performance through layer-adaptive weight pruning.