Pipeline Design of Nonvolatile-based Computing in Memory for Convolutional Neural Networks Inference Accelerators

Lixia Han,Peng Huang,Zheng Zhou,Yiyang Chen,Haozhang Yang,Xiaoyan Liu,Jinfeng Kang
DOI: https://doi.org/10.23919/date58400.2024.10546661
2024-01-01
Abstract:Nonvolatile-based computing-in-memory inference chips show great potential to accelerate convolutional neural networks. The intrinsic weight stationary characteristic makes pipeline design a crucial solution to further enhance throughput. In this work, we propose a balanced pipeline design and establish performance/area evaluation models for the optimal pipeline solution. The evaluation results indicate that our pipeline design achieves 30x computational efficiency improvement.
What problem does this paper attempt to address?