A 12.1 TOPS/W Quantized Network Acceleration Processor With Effective-Weight-Based Convolution and Error-Compensation-Based Prediction

Huiyu Mo,Wenping Zhu,Wenjing Hu,Qiang Li,Ang Li,Shouyi Yin,Shaojun Wei,Leibo Liu
DOI: https://doi.org/10.1109/JSSC.2021.3113569
IF: 5.4
2022-01-01
IEEE Journal of Solid-State Circuits
Abstract:In this article, a quantized network acceleration processor (QNAP) is proposed to efficiently accelerate CNN processing by eliminating most unessential operations based on algorithm-hardware co-optimizations. First, an effective-weight-based convolution (EWC) is proposed to distinguish a group of effective weights (EWs) to replace the other unique weights. Therefore, the input activations correspo...
What problem does this paper attempt to address?