Compression for Text Detection and Recognition Based on Low Bit-Width Quantization

Shuo Feng,Jian Cao,Yingbiao Luo,Zhenyuan Dai,Ying Zhang,Yuan Wang
DOI: https://doi.org/10.1109/icsip49896.2020.9339320
2020-01-01
Abstract:In recent years, with the development of Neural Network, it has made a significant breakthrough in the field of text detection and recognition. However, large-scale deep Neural Network needs a large amount of storage space and computing resources, which makes it difficult to be effectively applied in various hardware platforms. In order to solve this problem, we quantize the text detection and recognition model from floating-point to low bit-width. Utilizing the low bit-width quantization method in DoReFa-Net, we quantize the text detection network, Connectionist Text Proposal Network (CTPN) and the text recognition network, Convolutional Recurrent Neural Network (CRNN). Comparing the results in different quantized bit-width, we find that after quantizing the weights and activations of CTPN to 2-bit, the F-score only drops by 0.09 on the MTWI 2018 Dataset. When the weights and activations of CRNN is quantized to 4-bits, the accuracy decrease of training with Synthetic Chinese String Dataset is merely 0.1%. After the compression of the text detection and recognition, the size of model is reduced to only 32% of the original, which means that storage space can be saved by 68%.
What problem does this paper attempt to address?