When Sparsity Meets Dynamic Convolution

Shwai He,Yuhang Li,Chenbo Jiang,Shi Gu
DOI: https://doi.org/10.48550/arXiv.2204.02227
2022-01-01
Abstract:Dynamic convolution achieves a substantial performance boost for efficient CNNs at a cost of increased convolutional weights. Contrastively, mask-based unstructured pruning obtains a lightweight network by removing redun-dancy in the heavy network at risk of performance drop. In this paper, we propose a new framework to coherently integrate these two paths so that they can complement each other compensate for the disadvantages. We first design a binary mask derived from a learnable threshold to prune static kernels, significantly reducing the parameters and computational cost but achieving higher performance in Imagenet-1K(0.6% increase in top-1 accuracy with 0.67G fewer FLOPs). Based on this learnable mask, we further propose a novel dynamic sparse network incorporating the dynamic routine mechanism, which exerts much higher accuracy than baselines ( 2 . 63% increase in top-1 accuracy for MobileNetV1 with 90% sparsity). As a result, our method demonstrates a more efficient dynamic convolution with sparsity.
What problem does this paper attempt to address?