ResNet Can Be Pruned 60x: Introducing Network Purification and Unused Path Removal (P-RM) after Weight Pruning

Xiaolong Ma,Geng Yuan,Sheng Lin,Zhengang Li,Hao Sun,Yanzhi Wang
DOI: https://doi.org/10.48550/arXiv.1905.00136
2019-05-01
Abstract:The state-of-art DNN structures involve high computation and great demand for memory storage which pose intensive challenge on DNN framework resources. To mitigate the challenges, weight pruning techniques has been studied. However, high accuracy solution for extreme structured pruning that combines different types of structured sparsity still waiting for unraveling due to the extremely reduced weights in DNN networks. In this paper, we propose a DNN framework which combines two different types of structured weight pruning (filter and column prune) by incorporating alternating direction method of multipliers (ADMM) algorithm for better prune performance. We are the first to find non-optimality of ADMM process and unused weights in a structured pruned model, and further design an optimization framework which contains the first proposed Network Purification and Unused Path Removal algorithms which are dedicated to post-processing an structured pruned model after ADMM steps. Some high lights shows we achieve 232x compression on LeNet-5, 60x compression on ResNet-18 CIFAR-10 and over 5x compression on AlexNet. We share our models at anonymous link <a class="link-external link-http" href="http://bit.ly/2VJ5ktv" rel="external noopener nofollow">this http URL</a>.
Machine Learning,Artificial Intelligence,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to achieve efficient weight pruning in deep neural networks (DNN) to reduce computational and storage requirements while maintaining high accuracy. Specifically, the paper focuses on techniques that combine different types of structured sparsity (such as filter pruning and column pruning) in extreme structured pruning. These techniques are very effective in reducing the number of weights, but at the same time also bring the problem of accuracy loss. To solve these problems, the authors propose an optimization framework combined with the Alternating Direction Method of Multipliers (ADMM), as well as the Network Purification and Unused Path Removal (P - RM) algorithms for post - processing the model after ADMM - structured pruning, further improving the compression rate and maintaining network accuracy. The main contributions of the paper include: 1. Using the ADMM method to effectively optimize non - convex problems and successfully applying it to structured weight pruning. 2. Designing new Network Purification and Unused Path Removal (P - RM) algorithms, focusing on post - processing ADMM - structured pruning models to improve the compression rate while maintaining accuracy. Through these methods, the authors achieved significant model compression effects on multiple benchmark datasets. For example, a compression rate of 232 times was achieved on LeNet - 5, a compression rate of 60 times was achieved on ResNet - 18 (CIFAR - 10), and a compression rate of more than 5 times was also achieved on AlexNet, and none of these compressions led to significant accuracy degradation.