Lookahead: A Far-Sighted Alternative of Magnitude-based Pruning

Sejun Park,Jaeho Lee,Sangwoo Mo,Jinwoo Shin
DOI: https://doi.org/10.48550/arXiv.2002.04809
IF: 5.414
2020-02-12
Machine Learning
Abstract:Magnitude-based pruning is one of the simplest methods for pruning neural networks. Despite its simplicity, magnitude-based pruning and its variants demonstrated remarkable performances for pruning modern architectures. Based on the observation that magnitude-based pruning indeed minimizes the Frobenius distortion of a linear operator corresponding to a single layer, we develop a simple pruning method, coined lookahead pruning, by extending the single layer optimization to a multi-layer optimization. Our experimental results demonstrate that the proposed method consistently outperforms magnitude-based pruning on various networks, including VGG and ResNet, particularly in the high-sparsity regime. See https://github.com/alinlab/lookahead_pruning for codes.
What problem does this paper attempt to address?