Application of Model Compression and Acceleration Algorithms in Complex Task Model.

Zhaohui lv,Yuanyan Xie,Yang,Yu Guo,Zhenqiang Mi
DOI: https://doi.org/10.1145/3373509.3373545
2019-01-01
Abstract:Nowadays, the rapid development of artificial intelligence has brought great progress to computer vision (CV) technology. As the number of layers in the neural network model continues to deepen, the accuracy of the model is constantly improving, but the computational complexity of the model becomes larger and larger, and the model needs to be trained on GPU or even TPU. The huge amount of com- putation limits the many techniques of deep learning that can only be moved to the cloud and cannot be applied on mobile devices. In order to alleviate this problem, many scholars have proposed a variety of model compression and acceleration algorithms, mod- el pruning is one of the classic algorithms. As a common model compression and acceleration algorithms, model pruning can re- duce network complexity and computation effectively. However, the pruning algorithm generally has a long training time, and the accuracy of the model decreases too fast when the complex task is pruned. In view of this, this paper proposes two improved schemes, which can effectively alleviate the problem that the accuracy of model degradation is too fast, and greatly shorten the training time. When experimenting on a complex task such as the image caption model, the accuracy of the model can remain unchanged basically when the computational complexity of the model is reduced by half.
What problem does this paper attempt to address?