Deep Face Model Compression Using Entropy-Based Filter Selection.

Bingbing Han,Zhihong Zhang,Chuanyu Xu,Beizhan Wang,Guosheng Hu,Lu Bai,Qingqi Hong,Edwin R. Hancock
DOI: https://doi.org/10.1007/978-3-319-68548-9_12
2017-01-01
Abstract:The state-of-the-art face recognition systems are built on deep convolutional neural networks (CNNs). However, these CNNs contain millions of parameters, leading to the deployment difficulties on mobile and embedded devices. One solution is to reduce the size of the trained CNNs by model compression. In this work, we propose an entropy-based prune metric to reduce the size of intermediate activations so as to accelerate and compress CNN models both in training and inference stages. First the importance of each filter in each layer is evaluated by our entropy-based method. Then some unimportant filters are removed according to a predefined compressing rate. Finally, we fine-tune the pruned model to improve its discrimination ability. Experiments conducted on LFW face dataset shows the effectiveness of our entropy-based method. We achieve 1.92x compression and 1.88x speed-up on VGG-16 model, 2x compression and 1.74x speed-up on WebFace model, both with only about 1% accuracy decrease evaluated on LFW.
What problem does this paper attempt to address?