Abstract:Although deep convolutional neural networks (CNNs) have achieved significant success in computer vision applications, the real-world deployment of CNNs is often limited by computing resources and memory constraints. As a mainstream deep model compression technology, neural network pruning offers a promising prospect to reduce models' parameters and calculation. In this paper, we proposed a novel filter pruning method that combines convolution filters and feature maps information for convolutional neural network compression, namely network pruning by using cluster similarity and large eigenvalues (CSHE). First, based on the convolution operation, we explore the similarity relationship of feature maps generated by the corresponding filters. Concretely, the clustering algorithm is used to classify the similarity of filter to guide the classification of feature map. Secondly, the proposed method utilizes the information of the large eigenvalues of the feature maps to rank the importance of filters. Finally, we prune the low-ranking filters and remain the high-ranking ones. The proposed method eliminates redundancy in convolution filters by applying large eigenvalues of feature maps based on filters similarity. In this way, most of the representative information in the network can be retained and the pruned results can be easily reproduced. Experiments show that the accuracy of the pruned sparse deep network obtained by the CSHE method in the classification tasks of CIFAR-10 and ImageNet ILSVRC-12 is almost the same as that of the reference network without any additional constraints.

A Novel Clustering-Based Filter Pruning Method for Efficient Deep Neural Networks.

Pruning by Training: A Novel Deep Neural Network Compression Framework for Image Processing.

Class-Aware Pruning for Efficient Neural Networks

Filter Pruning Via Feature Map Clustering.

Towards Efficient Filter Pruning Via Topology

Batch-Normalization-based Soft Filter Pruning for Deep Convolutional Neural Networks

Towards Efficient Filter Pruning Via Adaptive Automatic Structure Search

Online Filter Clustering and Pruning for Efficient Convnets

Asymptotic Soft Cluster Pruning for Deep Neural Networks

CUP: Cluster Pruning for Compressing Deep Neural Networks

Online Filter Weakening and Pruning for Efficient Convnets

Filter Pruning with a Feature Map Entropy Importance Criterion for Convolution Neural Networks Compressing

Pruning the Deep Neural Network by Similar Function

CSHE: Network Pruning by Using Cluster Similarity and Matrix Eigenvalues

ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression.

Convolutional neural network acceleration algorithm based on filters pruning

A Pruning Method Based on the Dissimilarity of Angle among Channels and Filters

Dynamic Network Pruning Via Filter Attention Mechanism and Feature Scaling Factor

Pruning filters with L1-norm and standard deviation for CNN compression

A Graphical Approach for Filter Pruning by Exploring the Similarity Relation between Feature Maps

A Compact Parallel Pruning Scheme for Deep Learning Model and Its Mobile Instrument Deployment