Translational Bit-by-Bit Multi-bit Quantization for CRNN on Keyword Spotting.

Bo Liu,Yuhao Sun
DOI: https://doi.org/10.1109/cyberc.2019.00082
2019-01-01
Abstract:Keyword spotting (KWS) is one of automatic speech recognition (ASR) research fields. KWS algorithms based on neural network are remarkably better than others. It is extremely suitable for KWS algorithms to deploy on portable hardware devices to achieve excellent performance, but the number of weights is too large to apply due to limited hardware storage resources. Quantization is a natural approach to solve this puzzle. We propose a quantization formula to deploy KWS algorithms based on neural network on hardware devices effectively. In this paper, we consider symmetry of weights about the origin, and reorganize the order of 1-bit quantization of residual error and alternating quantization of minimization, proposing our translational bit-by-bit multi-bit quantization. Furthermore, we implement our method to convolutional recurrent neural network (CRNN) for KWS problem, analyzing results on two aspects, root mean square error (RMSE) and accuracy respectively. We can obtain smaller RMSE between quantized weights and full precision weights, the cost of it is the increment of computational complexity, but we can execute quantization in every mini-batch to constrain training time within a reasonable range. Our formula can get an acceptable accuracy by 3-bit quantization, with ~10.65× memory saving. And we can obtain no loss accuracy comparing to full precision by 4-bit quantization, sometimes even better, with ~7.99× memory saving. Our quantization method can achieve better performance and save memory resources, which are able to be used to deploy KWS algorithms based on neural network on hardware devices effectively.
What problem does this paper attempt to address?