Rldr-Pruning: Restricted Linear Dimensionality Reduction Approach For Model Compression

Xiaoru Liu,Jing Wu,Chengnian Long
DOI: https://doi.org/10.1007/978-3-030-36711-4_29
2019-01-01
Abstract:This paper studies pruning a trained deep neural network for resource-constrained devices. In general, pruning a trained CNN is an iterative process: start with a trained CNN; choose and prune the least important parameter; fine-tune to get another trained CNN. It is reasonable to define the least important parameter as the parameter that leads to the minimum accuracy drop after pruning and fine-tuning. However, directly searching such parameter is computationally infeasible because of fine-tuning. Therefore, current methods ignore fine-tuning when choosing parameter to prune. We take fine-tuning into consideration and propose our RLDR-pruning method. To make the searching feasible in our method, we first model the fine-tuning process and propose an one-step process called mini-tuning. Then, our RLDR-pruning replaces fine-tuning with mini-tuning when searching the least important parameter. Via experiments on classification tasks, we demonstrate that RLDR-pruning achieves significantly higher inference accuracy than existing techniques based on similar parameter optimization capabilities.
What problem does this paper attempt to address?