Data-Free Quantization with Accurate Activation Clipping and Adaptive Batch Normalization

Yefei He,Luoming Zhang,Weijia Wu,Hong Zhou
DOI: https://doi.org/10.1007/s11063-023-11338-6
IF: 2.565
2023-01-01
Neural Processing Letters
Abstract:Data-free quantization compresses the neural network to low bit-width without access to original training data. Most existing data-free quantization methods cause severe performance degradation due to inaccurate activation clipping range and quantization error. In this paper, we present a simple yet effective data-free quantization method with accurate activation clipping and adaptive batch normalization. Accurate activation clipping (AAC) improves the model accuracy by exploiting accurate activation information from the full-precision model. Adaptive batch normalization (ABN) firstly proposes to address the quantization error from distribution changes by updating the batch normalization layer adaptively. Extensive experiments demonstrate that the proposed data-free quantization method can yield surprising performance, achieving 64.33% top-1 accuracy of 4-bit ResNet18 on ImageNet dataset, with 3.7% absolute improvement outperforming the existing state-of-the-art methods.
What problem does this paper attempt to address?