Mixed-precision quantized neural networks with progressively decreasing bitwidth

Tianshu Chu,Qin Luo,Jie Yang,Xiaolin Huang
DOI: https://doi.org/10.1016/j.patcog.2020.107647
IF: 8
2021-01-01
Pattern Recognition
Abstract:•We address the trade-off issue between aggressive model compression and the superior performance of quantized neural networks.•Based on the observation on internal feature distributions, a mixed-precision QNN with progressively decreasing bitwidth is proposed.•A heuristic of bitwidth assignment based on the quantitative separability for feature representation is given.•Several typical CNNs including AlexNex, ResNet and Faster R-CNN are quantized based on the proposed mixed-precision method.•The experimental results demonstrate that the mixed-precision networks could achieve preferable performance with less memory space.
What problem does this paper attempt to address?