Symmetric Quantization Scheme and Approximate Batch Normalization for Quantized Neural Networks Training

Xiangyu Wu,Zhisong Bie,Xuehong Lin
DOI: https://doi.org/10.1088/1742-6596/1176/3/032024
2019-01-01
Abstract:With the advancements of circuit design and manufacturing processes of hardware, Deep Neural Networks (DNN) have been developed to be deeper and larger to achieve remarkable results in many aspects. Despite this, problems such as excessive memory usage and severe resource consumption are still unavoidable. We proposed a symmetric quantization scheme, which quantizes the weights from a certain kernel to be symmetric to zero along with a scaling factor and bias term in order to make full use of the whole quantization space and reduce the multiplication operations when convolution happens. We also came up with an Approximate Batch Normalization (ABN) method, which simplifies the traditional BN to linear operations lightly. Besides, activations and gradients are quantized in our scheme with fine distinctions, so that most floating-point calculations can be converted into fixed-point calculations and bit-shift operations in both training and inference phase. Our proposed scheme greatly alleviates the problems mentioned above theoretically with fewer computations and smaller storage theoretically. We carried out various experiments to prove its feasibility.
What problem does this paper attempt to address?