PAICORE: A 1.9-Million-neuron 5.181-TSOPS/W Digital Neuromorphic Processor with Unified SNN-ANN and On-Chip Learning Paradigm
Yi Zhong,Yisong Kuang,Kefei Liu,Zilin Wang,Shuo Feng,Guang Chen,Youming Yang,Xiuping Cui,Qiankun Wang,Jian Cao,Song Jia,Yun Liang,Guangyu Sun,Xiaoxin Cui,Ru Huang,Yuan Wang
DOI: https://doi.org/10.1109/jssc.2024.3426319
IF: 5.4
2024-01-01
IEEE Journal of Solid-State Circuits
Abstract:The neuromorphic approach of fulfilling brain-like edge intelligence is confronted with three paramount challenges: 1) ever-increasing application demands versus insufficient on-chip resources; 2) diverse data sources and models versus heterogeneous computing paradigms; and 3) adaptation to real scenarios versus absence of on-chip learning. To bridge those gaps in the current research community, this article presents PAICORE, a 537.98-mm(2) digital neuromorphic processor with a unified computing and learning paradigm. PAICORE is a scalable 1024-core design, integrating over 1.919 million neurons and 4.773 billion synapses on a single chip. PAICORE is implemented in globally asynchronous locally synchronous (GALS) style with a five-level fat up-down quadtree as dedicated network-on-chip (NoC) infrastructure and tiled 2-D chip array as inter-chip architecture. The distributed processing cores fuse the hybrid spiking neural network (SNN), artificial neural network (ANN), and on-chip spike-timing-dependent plasticity (STDP) learning models into a unified description by modular-level reconfiguration. PAICORE achieves the peak performance of 20.74 TSOPS, 41.49 TOPS, and 115.25 GSOPS, with the best energy efficiency of 5.181 TSOPS/W, 10.372 TOPS/W, and 1.222 TSOPS/W for SNN, ANN, and on-chip learning paradigms, respectively. Benefiting from the PAIFLOW software framework, PAICORE hardware platform can be equivalently simulated and efficiently programmed in a large-scale deployment, in accordance with specific optimizing demands for its target multi-paradigm tasks.