HARNS: High-level Architectural Model of RRAM Based Computing-in-memory NPU

Yanhai Jiang,Bin Gao,Jianshi Tang,Dabin Wu,Hu He,He Qian,Huaqiang Wu
DOI: https://doi.org/10.1109/icta53157.2021.9661827
2021-01-01
Abstract:As the research on deep neural networks booms, there is broad interest in the accelerator with RRAM-based computing-in-memory architecture to tackle the problem of "memory wall". However, the simulator for large-scale RRAM-based SoC, expected to provide detailed performance analysis, remains unexplored. In this paper, we develop HARNS in C++ and SystemC, an architectural model of RRAM-based neural-processing-unit. HARNS proposes a flexible structure for users to customize the architecture, and provides cycle-accurate performance simulation for design space exploration. A 7-layer convolution network is demonstrated on HARNS, followed by some optimization schemes of architecture and mapping.
What problem does this paper attempt to address?