ResNet Can Be Pruned 60x: Introducing Network Purification and Unused Path Removal (P-RM) after Weight Pruning

Xiaolong Ma,Geng Yuan,Sheng Lin,Zhengang Li,Hao Sun,Yanzhi Wang

DOI: https://doi.org/10.48550/arXiv.1905.00136

2019-05-01

Abstract:The state-of-art DNN structures involve high computation and great demand for memory storage which pose intensive challenge on DNN framework resources. To mitigate the challenges, weight pruning techniques has been studied. However, high accuracy solution for extreme structured pruning that combines different types of structured sparsity still waiting for unraveling due to the extremely reduced weights in DNN networks. In this paper, we propose a DNN framework which combines two different types of structured weight pruning (filter and column prune) by incorporating alternating direction method of multipliers (ADMM) algorithm for better prune performance. We are the first to find non-optimality of ADMM process and unused weights in a structured pruned model, and further design an optimization framework which contains the first proposed Network Purification and Unused Path Removal algorithms which are dedicated to post-processing an structured pruned model after ADMM steps. Some high lights shows we achieve 232x compression on LeNet-5, 60x compression on ResNet-18 CIFAR-10 and over 5x compression on AlexNet. We share our models at anonymous link <a class="link-external link-http" href="http://bit.ly/2VJ5ktv" rel="external noopener nofollow">this http URL</a>.

Machine Learning,Artificial Intelligence,Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to achieve efficient weight pruning in deep neural networks (DNN) to reduce computational and storage requirements while maintaining high accuracy. Specifically, the paper focuses on techniques that combine different types of structured sparsity (such as filter pruning and column pruning) in extreme structured pruning. These techniques are very effective in reducing the number of weights, but at the same time also bring the problem of accuracy loss. To solve these problems, the authors propose an optimization framework combined with the Alternating Direction Method of Multipliers (ADMM), as well as the Network Purification and Unused Path Removal (P - RM) algorithms for post - processing the model after ADMM - structured pruning, further improving the compression rate and maintaining network accuracy. The main contributions of the paper include: 1. Using the ADMM method to effectively optimize non - convex problems and successfully applying it to structured weight pruning. 2. Designing new Network Purification and Unused Path Removal (P - RM) algorithms, focusing on post - processing ADMM - structured pruning models to improve the compression rate while maintaining accuracy. Through these methods, the authors achieved significant model compression effects on multiple benchmark datasets. For example, a compression rate of 232 times was achieved on LeNet - 5, a compression rate of 60 times was achieved on ResNet - 18 (CIFAR - 10), and a compression rate of more than 5 times was also achieved on AlexNet, and none of these compressions led to significant accuracy degradation.

ResNet Can Be Pruned 60x: Introducing Network Purification and Unused Path Removal (P-RM) after Weight Pruning

Class-Aware Pruning for Efficient Neural Networks

Loss Constrains Added Squeeze and Excitation Blocks for Pruning Deep Neural Networks

Structured Deep Neural Network Pruning by Varying Regularization Parameters.

Pruning by Training: A Novel Deep Neural Network Compression Framework for Image Processing.

Structured Probabilistic Pruning for Convolutional Neural Network Acceleration.

StructADMM: A Systematic, High-Efficiency Framework of Structured Weight Pruning for DNNs

Progressive DNN Compression: A Key to Achieve Ultra-High Weight Pruning and Quantization Rates using ADMM

AutoCompress: An Automatic DNN Structured Pruning Framework for Ultra-High Compression Rates

A Systematic DNN Weight Pruning Framework Using Alternating Direction Method of Multipliers

A Dynamic Pruning Method on Multiple Sparse Structures in Deep Neural Networks

Reweighted Alternating Direction Method of Multipliers for DNN weight pruning

A systematic DNN weight pruning framework based on symmetric accelerated stochastic ADMM

A Unified Framework of DNN Weight Pruning and Weight Clustering/Quantization Using ADMM

SS-Auto: A Single-Shot, Automatic Structured Weight Pruning Framework of DNNs with Ultra-High Efficiency

Efficient Micro-Structured Weight Unification and Pruning for Neural Network Compression

Tiny but Accurate: A Pruned, Quantized and Optimized Memristor Crossbar Framework for Ultra Efficient DNN Implementation

Adversarial Structured Neural Network Pruning

A Pruning Method Based on the Dissimilarity of Angle among Channels and Filters

A systematic network pruning framework using ADMM-type algorithm

Online Filter Weakening and Pruning for Efficient Convnets