A Reconfigurable Computing-in-Memory Accelerator with Dynamic Group-Based Dataflow and Dual-Input Macro Designs

Pufan Xu,Xing Mou,Bin Gao,Qiumeng Wei,Peng Yao,Jianshi Tang,He Qian,Huaqiang Wu
DOI: https://doi.org/10.1109/tcsii.2024.3442873
2024-01-01
Abstract:Non-volatile memory-based computing-in-memory (nvCIM) is a promising candidate for accelerating deep neural networks (DNNs) at the edge. However, current nvCIMs adopt fully-pipelined (FP) or layer-serial (LS) dataflows for all DNN layers, suffering poor area and energy efficiency for the layer-wise-varied workloads. Furthermore, their fixed macro structure results in resource under-utilization, as it is unable to adapt to varying weight sizes. To address these issues, this paper proposes a reconfigurable nvCIM with dynamic dataflow. First, it contains a dynamic inter-pipelined-intra-serial (IPIS) dataflow with group partition mechanism, adapting to the diverse workloads for high area and energy efficiency. Second, it has a dual-input block-reconfigurable (DIBR) macro structure, allowing finer granularity input selection to improve macro utilization and achieve input data reuse. When applied to four well-known networks, the proposed design attains 2.27 11.92 × area efficiency gains and 2.21 14.43 × energy efficiency gains over nvCIM baselines.
What problem does this paper attempt to address?