Neural inference at the frontier of energy, space, and time

Dharmendra S Modha,Filipp Akopyan,Alexander Andreopoulos,Rathinakumar Appuswamy,John V Arthur,Andrew S Cassidy,Pallab Datta,Michael V DeBole,Steven K Esser,Carlos Ortega Otero,Jun Sawada,Brian Taba,Arnon Amir,Deepika Bablani,Peter J Carlson,Myron D Flickner,Rajamohan Gandhasri,Guillaume J Garreau,Megumi Ito,Jennifer L Klamo,Jeffrey A Kusnitz,Nathaniel J McClatchey,Jeffrey L McKinstry,Yutaka Nakamura,Tapan K Nayak,William P Risk,Kai Schleupen,Ben Shaw,Jay Sivagnaname,Daniel F Smith,Ignacio Terrizzano,Takanori Ueda,Dharmendra S. Modha,John V. Arthur,Andrew S. Cassidy,Michael V. DeBole,Steven K. Esser,Peter J. Carlson,Myron D. Flickner,Guillaume J. Garreau,Jennifer L. Klamo,Jeffrey A. Kusnitz,Nathaniel J. McClatchey,Jeffrey L. McKinstry,Tapan K. Nayak,William P. Risk,Daniel F. Smith
DOI: https://doi.org/10.1126/science.adh1174
IF: 56.9
2023-10-21
Science
Abstract:Computing, since its inception, has been processor-centric, with memory separated from compute. Inspired by the organic brain and optimized for inorganic silicon, NorthPole is a neural inference architecture that blurs this boundary by eliminating off-chip memory, intertwining compute with memory on-chip, and appearing externally as an active memory chip. NorthPole is a low-precision, massively parallel, densely interconnected, energy-efficient, and spatial computing architecture with a co-optimized, high-utilization programming model. On the ResNet50 benchmark image classification network, relative to a graphics processing unit (GPU) that uses a comparable 12-nanometer technology process, NorthPole achieves a 25 times higher energy metric of frames per second (FPS) per watt, a 5 times higher space metric of FPS per transistor, and a 22 times lower time metric of latency. Similar results are reported for the Yolo-v4 detection network. NorthPole outperforms all prevalent architectures, even those that use more-advanced technology processes.
multidisciplinary sciences
What problem does this paper attempt to address?