An Efficient Model Compression Method for CNN Based Object Detection

Liuchen Qian,Yuzhuo Fu,Ting Liu
DOI: https://doi.org/10.1109/icsess.2018.8663809
2018-01-01
Abstract:Object detection algorithms like Faster R-CNN and YOLO have demonstrated excellent results on datasets such as the PASCAL VOC and COCO. However, these algorithms need powerful GPUs which makes them infeasible in embedded system because of the huge number of FLOPS and parameters. This paper uses two techniques to solve this problem: depthwise separable convolution and filter pruning. Depthwise separable convolution has been proven to be an efficient technique for reducing FLOPS and parameters in some object detection networks. Filter pruning can also reduce FLOPS and parameters significantly while regaining close to the original accuracy in image classification tasks. This paper also optimizes the pruning method which can greatly reduce pruning time by changing step size according to the sensitivity of each layer. In this work, YOLOv3-tiny network will be modified by depthwise separable convolutions and filter pruning. The results show that FLOPS drops 54%, parameters and model size drop 71 % while mAP just decreases 1.85%.
What problem does this paper attempt to address?