A Low Bit-Width Parameter Representation Method for Hardware-Oriented Convolution Neural Networks.

Qiang Chen,Chen Xin,Chenglong Zou,Xinan Wang,Bo Wang
DOI: https://doi.org/10.1109/asicon.2017.8252433
2017-01-01
Abstract:Convolution neural networks (CNNs) have become the state-of-the-art technique for Artificial Intelligence (AI) applications. But they consume resources heavily both in computation and energy. Compressing the parameters to fewer bits is an effective way to reduce data quantity, making it possible to deploy CNNs on mobile, IoT and other resource-limited devices. The existing means need to re-train the entire networks and tune the parameters. This paper presents a convenient method to compress the parameters of trained CNN models without re-training and fine-tuning. Our method reduces the number of bits from 32 to 5-9 under the same networks and on the same dataset, with the loss of accuracy less than 1%. The logic power of a trained LeNet reduces by 70.58% on FPGA evaluation.
What problem does this paper attempt to address?