Towards Efficient Convolutional Neural Network for Embedded Hardware Via Multi-Dimensional Pruning.

Hao Kong,Di Liu,Xiangzhong Luo,Shuo Huai,Ravi Subramaniam,Christian Makaya,Qian Lin,Weichen Liu
DOI: https://doi.org/10.1109/dac56929.2023.10247965
2023-01-01
Abstract:In this paper, we propose TECO, a multi-dimensional pruning framework to collaboratively prune the three dimensions (depth, width, and resolution) of convolutional neural networks (CNNs) for better execution efficiency on embedded hardware. In TECO, we first introduce a two-stage importance evaluation framework, which efficiently and comprehensively evaluates each pruning unit according to both the local importance inside each dimension and the global importance across different dimensions. Based on the evaluation framework, we present a heuristic pruning algorithm to progressively prune the three dimensions of CNNs towards the optimal trade-off between accuracy and efficiency. Experiments on multiple benchmarks validate the advantages of TECO over existing state-of-the-art (SOTA) approaches. The code and pre-trained models are available anonymously at https://github.com/ntuliuteam/Teco.
What problem does this paper attempt to address?