A Time-Domain Computing-in-Memory based Processor using Predictable Decomposed Convolution for Arbitrary Quantized DNNs

Jianxun Yang,Yuyao Kong,Zhao Zhang,Zhuangzhi Liu,Jing Zhou,Yiqi Wang,Yonggang Liu,Chenfu Guo,Te Hu,Congcong Li,Leibo Liu,Jin Zhang,Shaojun Wei,Jun Yang,Shouyi Yin
DOI: https://doi.org/10.1109/A-SSCC48613.2020.9336145
2020-01-01
Abstract:Time-domain based computing-in-memory (TD-CIM) architectures present superior flexibility, accuracy and scalability for deep neural networks (DNNs). However, there are still three challenges for accelerating multi-bit DNNs due to their inferior bit-split convolution computation: poor network adaptability, massive redundant computations, and high quantization energy and error. This work firstly proposes a unique-weight kernel decomposition based convolution computation method for CIMs to accelerate multi-bit DNNs. Then a TD-CIM based processor is designed with three techniques to further address above challenges: 1) Cross-flipping based fast kernel decomposer to reduce memory accesses for various DNNs, 2) Dual-mode complementary predictor to remove redundant computations, 3) Activation-weight-adaptive pulse quantizer to reduce quantization energy and error. Fabricated in 28nm and evaluated with 1-to-8bit AlexNet and VGG16, the processor achieves a peak energy efficiency of 2.4-to-152.7TOPS/W.
What problem does this paper attempt to address?