Filter pruning via expectation-maximization

Sheng Xu,Yanjing Li,Linlin Yang,Baochang Zhang,Dianmin Sun,Kexin Liu
DOI: https://doi.org/10.1007/s00521-022-07127-2
2022-01-01
Neural Computing and Applications
Abstract:The redundancy in convolutional neural networks (CNNs) causes a significant number of extra parameters resulting in increased computation and less diverse filters. In this paper, we introduce filter pruning via expectation-maximization (FPEM) to trim redundant structures and improve the diversity of remaining structures. Our method is designed based on the discovery that the filter diversity of pruned networks is positively correlated with its performance. The expectation step divides filters into groups by maximum likelihood layer-wisely, and averages the output feature maps for each cluster. The maximization step calculates the likelihood estimation of clusters and formulates a loss function to make the distributions in the same cluster consistent. After training, the intra-cluster redundant filters can be trimmed and only intra-cluster diverse filters are retained. Experiments conducted on CIFAR-10 have outperformed the corresponding full models. On ImageNet ILSVRC12, FPEM reduces 46.5% FLOPs on ResNet-50 with only 0.36% Top-1 accuracy decrease, which advances the state-of-arts. In particular, the FPEM offers strong generalization performance on the object detection task.
What problem does this paper attempt to address?