An Energy-Efficient Computing-in-Memory NN Processor with Set-Associate Blockwise Sparsity and Ping-Pong Weight Update
Jinshan Yue,Yongpan Liu,Xiaoyu Feng,Yifan He,Jingyu Wang,Zhe Yuan,Mingtao Zhan,Jiaxin Liu,Jian-Wei Su,Yen-Lin Chung,Ping-Chun Wu,Li-Yang Hong,Meng-Fan Chang,Nan Sun,Chunmeng Dou,Xueqing Li,Ming Liu,Huazhong Yang
DOI: https://doi.org/10.1109/jssc.2023.3324954
IF: 5.4
2024-01-01
IEEE Journal of Solid-State Circuits
Abstract:Computing-in-memory (CIM) chips have demonstrated the potential high energy efficiency for low-power neural network (NN) processors. Even with energy-efficient CIM macros, the existing system-level CIM chips still lack deep exploration on sparsity and large models, which prevents a higher system energy efficiency. This work presents a CIM NN processor with more sufficient support of sparsity and higher utilization rate. Three key innovations are proposed. First, a set-associate blockwise sparsity strategy is designed, which simultaneously saves execution time, power, and storage space. Second, a ping-pong weight update mechanism is proposed for a higher utilization rate, enabling simultaneous execution of CIM and write operations. Third, an efficient CIM macro is implemented with adaptive analog-digital converter (ADC) precision for better sparsity utilization and performance-accuracy trade-off. The 65-nm fabricated chip shows 9.5-TOPS/W system energy efficiency at 4-bit precision, with 6.25 $\times $ actual improvement compared with a state-of-the-art CIM chip. Besides, this work supports high CIM execution accuracy on the ImageNet dataset.