Efficient Joint Optimization of Layer-Adaptive Weight Pruning in Deep Neural Networks

Kaixin Xu,Zhe Wang,Xue Geng,Jie Lin,Min Wu,Xiaoli Li,Weisi Lin

2023-08-24

Abstract:In this paper, we propose a novel layer-adaptive weight-pruning approach for Deep Neural Networks (DNNs) that addresses the challenge of optimizing the output distortion minimization while adhering to a target pruning ratio constraint. Our approach takes into account the collective influence of all layers to design a layer-adaptive pruning scheme. We discover and utilize a very important additivity property of output distortion caused by pruning weights on multiple layers. This property enables us to formulate the pruning as a combinatorial optimization problem and efficiently solve it through dynamic programming. By decomposing the problem into sub-problems, we achieve linear time complexity, making our optimization algorithm fast and feasible to run on CPUs. Our extensive experiments demonstrate the superiority of our approach over existing methods on the ImageNet and CIFAR-10 datasets. On CIFAR-10, our method achieves remarkable improvements, outperforming others by up to 1.0% for ResNet-32, 0.5% for VGG-16, and 0.7% for DenseNet-121 in terms of top-1 accuracy. On ImageNet, we achieve up to 4.7% and 4.6% higher top-1 accuracy compared to other methods for VGG-16 and ResNet-50, respectively. These results highlight the effectiveness and practicality of our approach for enhancing DNN performance through layer-adaptive weight pruning. Code will be available on <a class="link-external link-https" href="https://github.com/Akimoto-Cris/RD_VIT_PRUNE" rel="external noopener nofollow">this https URL</a>.

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

### Problems Addressed by the Paper The paper primarily addresses the following issues: 1. **Optimization Objective**: Proposes a new layer-adaptive weight pruning method to tackle the challenge of simultaneously minimizing output distortion and meeting the target pruning ratio constraint in deep neural networks (DNNs). 2. **Pruning Scheme**: Designs a layer-adaptive pruning scheme by considering the overall impact of all layers. It discovers and leverages the additive property of output distortion caused by multi-layer pruning, transforming the pruning problem into a combinatorial optimization problem, which is efficiently solved through dynamic programming. 3. **Algorithm Performance Improvement**: By decomposing the problem into subproblems, it achieves linear time complexity, enabling the optimization algorithm to run quickly on a CPU. Experimental results show that this method outperforms existing methods on the ImageNet and CIFAR-10 datasets. For example, on CIFAR-10, the top-1 accuracy for ResNet-32, VGG-16, and DenseNet-121 improved by 1.0%, 0.5%, and 0.7% respectively; on ImageNet, the top-1 accuracy for VGG-16 and ResNet-50 improved by 4.7% and 4.6% respectively. These results highlight the effectiveness and practicality of the method in improving DNN performance through layer-adaptive weight pruning.

Efficient Joint Optimization of Layer-Adaptive Weight Pruning in Deep Neural Networks

Class-Aware Pruning for Efficient Neural Networks

Pruning by Training: A Novel Deep Neural Network Compression Framework for Image Processing.

A Systematic DNN Weight Pruning Framework Using Alternating Direction Method of Multipliers

Differential Evolution Based Layer-Wise Weight Pruning for Compressing Deep Neural Networks

AdaPruner: Adaptive Channel Pruning and Effective Weights Inheritance

Optimization Based Layer-Wise Pruning Threshold Method for Accelerating Convolutional Neural Networks

FVW: Finding Valuable Weight on Deep Neural Network for Model Pruning

Optimization based Layer-wise Magnitude-based Pruning for DNN Compression

ResNet Can Be Pruned 60x: Introducing Network Purification and Unused Path Removal (P-RM) after Weight Pruning

Knapsack Pruning with Inner Distillation

Progressive DNN Compression: A Key to Achieve Ultra-High Weight Pruning and Quantization Rates using ADMM

A Unified Framework of DNN Weight Pruning and Weight Clustering/Quantization Using ADMM

A Dynamic Pruning Method on Multiple Sparse Structures in Deep Neural Networks

Deep Learning Architecture Improvement Based on Dynamic Pruning and Layer Fusion

Learning to Prune Deep Neural Networks via Layer-wise Optimal Brain Surgeon

Efficient DNN Neuron Pruning by Minimizing Layer-wise Nonlinear Reconstruction Error

Complex hybrid weighted pruning method for accelerating convolutional neural networks

A systematic DNN weight pruning framework based on symmetric accelerated stochastic ADMM

Layer Pruning for Accelerating Very Deep Neural Networks.

Post-training deep neural network pruning via layer-wise calibration