7.3 A 28nm 38-to-102-tops/w 8b Multiply-Less Approximate Digital SRAM Compute-In-Memory Macro for Neural-Network Inference

Yifan He,Haikang Diao,Chen Tang,Wenbin Jia,Xiyuan Tang,Yuan Wang,Jinshan Yue,Xueqing Li,Huazhong Yang,Hongyang Jia,Yongpan Liu
DOI: https://doi.org/10.1109/isscc42615.2023.10067305
2023-01-01
Abstract:This paper presents a 2-to-8-b scalable digital SRAM-based CIM macro that is co-designed with a multiply-less neural-network (NN) design methodology and incorporates dynamic-logic-based approximate circuits for vector-vector operations. Digital CIMs enable high throughput and reliable matrix-vector multiplications (MVMs); however, digital CIMs face three major challenges to obtain further aggressive gains over conventional digital architectures: (1) prior digital CIMs exploiting approximate computation suffer from accuracy degradation [1]; (2) digital [2] and, as [3] predicted, mixed-signal CIMs [4], suffer from quadratic energy scaling with improving operand precision; (3) the tight and regular memory layout prevent s CIMs from leveraging unstructured bit-level statistics.
What problem does this paper attempt to address?