Deep Neural Architecture Pruning

Yanan Sun,Gary G. Yen,Mengjie Zhang
DOI: https://doi.org/10.1007/978-3-031-16868-0_15
2022-01-01
Abstract:Various techniques have been proposed that take into account both the scale of parameters and performance [1–3]. Analysis of sensitivity or testing in a variety of configurations are commonly used to obtain a suitable compromise between model size and performance. Meanwhile, because convolutional operations take the majority of computing time, it has been investigated that pruning convolutional filters is a potentially promising method for striking a balance between the performance and the scale of parameters. As a result, several criteria for pruning filters in CNNs have been suggested [4–6] and implemented [7].
What problem does this paper attempt to address?