A Model Compression Method Using Significant Data and Knowledge Distillation

Fan Zhang,Beibei Xu,Kai Cui
DOI: https://doi.org/10.1109/ipccc59175.2023.10253832
2023-01-01
Abstract:With the emergence of complex neural network models, the running cost of the models shows an increasing trend, while the larger the model, the exponential increase in training cost. Therefore, this paper proposes a method to compress the model of Convolutional Neural Network (CNN) by combining data selection and knowledge distillation, so that the performance of the original model can be maintained with low computational cost. (1) For data selection, we propose an algorithm to rank the importance of training data based on influence scores, and select some training data with greater improvement in model performance as the new training data set based on the ranking results. (2) A knowledge distillation method with low data volume is proposed to input some important training data into the teacher model to generate knowledge to guide the training of the student model, so as to reduce the model running cost while maintaining the model performance of the teacher model, and such a method can also availably avoid the overfitting phenomenon when the data volume is too small. By compressing both the data and the model, the training and running costs of the model can be effectively reduced.
What problem does this paper attempt to address?