Dynamic Network Pruning Via Filter Attention Mechanism and Feature Scaling Factor

Hai-wei LU,Hai-feng XIA,Xiao-tong YUAN
DOI: https://doi.org/10.3969/j.issn.1000-1220.2019.09.005
2019-01-01
Abstract:Structured pruning is an effective way of model compression,which reduces the unimportant filters in the network and re-duces the amount of computation and storage of the network. . However,it is impossible to accurately determine the filter based on the parameter information of the filter itself. A dynamic pruning method is proposed,which uses the attention mechanism of the filter and the BN layer scaling factor to select a redundant filter and crop it. The method has three advantages:1. End-to-end training pruning:training and pruning are performed at the same time and the training speed is faster. 2. Larger optimization space:The training network dynamically adjusts the cropped filter to search for the optimal pruning strategy. 3. More accurate filter selection:Multiple parameter information selects redundant filters to ensure the performance of the network. The experiments were carried out on CIFAR-10 and CI-FAR-100 respectively. The experimental results on the CIFAR-10 dataset showed that the floating point operations of the compressed ResNet56 and ResNet110 were reduced by more than 40% ,but the accuracy was improved.
What problem does this paper attempt to address?