Learning Sparse Patterns in Deep Neural Networks

Weijing Wen,Fan Yang,Yangfeng Su,Dian Zhou,Xuan Zeng
DOI: https://doi.org/10.1109/asicon47005.2019.8983429
2019-01-01
Abstract:Acceleration in deeper neural networks is hindered by high demand for computation and memory storage in resource constrained devices. In this paper, we propose a novel regularization method to learn hardware-friendly sparse structures for deep convolutional neural networks. A limited number of fixed sparse patterns are learned through the regularization and the filters are represented by these fixed sparse patterns. Our regularization method can produce compact models with higher degree of sparsity than the state-of-the-art structured sparsity learning method. The limited number of fixed sparse patterns also facilitate the hardware implementation. Our experiments on several deep neural networks with image classification tasks show that our proposed approach can achieve remarkably higher degree of sparsity with acceptable loss of accuracy.
What problem does this paper attempt to address?