Exploring the Granularity of Sparsity in Convolutional Neural Networks.

Huizi Mao,Song Han,Jeff Pool,Wenshuo Li,Xingyu Liu,Yu Wang,William J. Dally
DOI: https://doi.org/10.1109/CVPRW.2017.241
2017-01-01
Abstract:Sparsity helps reducing the computation complexity of DNNs by skipping the multiplication with zeros. The granularity of sparsity affects the efficiency of hardware architecture and the prediction accuracy. In this paper we quantitatively measure the accuracy-sparsity relationship with different granularity. Coarse-grained sparsity brings more regular sparsity pattern, making it easier for hardware acceleration, and our experimental results show that coarse-grained sparsity have very small impact on the sparsity ratio given no loss of accuracy. Moreover, due to the index saving effect, coarse-grained sparsity is able to obtain similar or even better compression rates than fine-grained sparsity at the same accuracy threshold. Our analysis, which is based on the framework of a recent sparse convolutional neural network (SCNN) accelerator, further demonstrates that it saves 30% - 35% of memory references compared with fine-grained sparsity.
What problem does this paper attempt to address?