Channel Pruning for Accelerating Convolutional Neural Networks via Wasserstein Metric.

Haoran Duan,Hui Li
DOI: https://doi.org/10.1007/978-3-030-69535-4_30
2020-01-01
Abstract:Channel pruning is an effective way to accelerate deep convolutional neural networks. However, it is still a challenge to reduce the computational complexity while preserving the performance of deep models. In this paper, we propose a novel channel pruning method via the Wasserstein metric. First, the output features of a channel are aggregated through the Wasserstein barycenter, which is called the basic response of the channel. Then the channel discrepancy based on the Wasserstein distance is introduced to measure channel importance, by considering both the channel’s feature representation ability and the substitutability of the basic responses. Finally, channels with the least discrepancies are removed directly, and the loss in accuracy of the pruned model is regained by fine-tuning. Extensive experiments on popular benchmarks and various network architectures demonstrate that the proposed approach outperforms the existing methods.
What problem does this paper attempt to address?