A Multi-precision Quantized Super-Resolution Model Framework

Jingyu Liu,Dunbo Zhang,Qiong Wang,Li Shen
DOI: https://doi.org/10.1007/978-3-030-95384-3_22
2022-01-01
Abstract:Equipment’s computing capability has been greatly enhanced at present, which helps deep learning achieve excellent results in various applications, such as super-resolution. However, for higher performance, lower model size and faster computing speed, model compression is widely applied to accomplish the goal. For instance, model quantization is a typical compression method, such as quantization aware training and etc. Quantization aware training can take more quantization loss due to data mapping in model training into account, clamping and approximating the data representation range when updating parameters, which introduces quantization errors into loss function. In the quantization process, we used a quantization strategy that we quantized the model in different stages of combination, and found that some stages of the two super-resolution models’ generators based on SRGAN and ESRGAN showed sensitivity to quantization during the process, which greatly reduced the performance. Therefore, according to the quantization sensitivity, we use higher bits integer quantization for the sensitive stage, and get the multi-precision quantized model. For quantizing the SR model automatically, we propose a multi-precision quantization framework in this paper according to the ratio of input and output channels in every stage in the model. We also have our work tested on eight classical data sets of super-resolution. Generally speaking, both the two models’ PI values approach the original model’s respectively.
What problem does this paper attempt to address?