Flexi-Compression: A Flexible Model Compression Method for Autonomous Driving

Hantao Liu,Ying He,F. Richard Yu,Jeremy James
DOI: https://doi.org/10.1145/3479243.3487300
2021-01-01
Abstract:Benefiting from the rapid development of convolutional neural networks, computer vision-based autonomous driving technologies are gradually being deployed in vehicles. However, these neural networks typically have a large number of parameters and extremely high computational cost, making them difficult to deploy in autonomous vehicles with limited storage and computational power. In this paper, we propose an innovative model compression approach to compress convolutional neural networks in autonomous driving algorithms, which we call Flexi-Compression. Flexi-Compression first modifies the model structure by replacing the traditional convolutional layers with our proposed Flexi-CP module, thus reducing the computation of the convolutional layers. Then, we leverage knowledge distillation to enable the compressed model to quickly acquire the knowledge of the original model. In addition, we use a Flexi-Batch Normalization layer to prune the model and finally further reduce the model size by model quantization. We compress the VGG-16 network using our proposed model compression algorithm, which is a commonly used backbone network in autonomous driving algorithms. On the CIFAR-10 dataset, our compression method can reduce the parameters of the VGG-16 network by 86% and the computation by 87% with 4% loss of accuracy. To verify the effectiveness of our compression algorithm in real-world applications, we also compress an autonomous driving algorithm and achieve excellent performance.
What problem does this paper attempt to address?