An In-Memory-Computing STT-MRAM Macro with Analog ReLU and Pooling Layers for Ultra-High Efficient Neural Network

Linjun Jiang,Sifan Sun,Jinming Ge,He Zhang,Wang Kang
DOI: https://doi.org/10.1109/nvmsa58981.2023.00026
2023-01-01
Abstract:In-memory computing (IMC) technology has great potential for neural network accelerators. However, the energy efficiency of current mainstream IMC designs is limited, since the peripheral circuitry (e.g., ADC) for multiply-and-accumulations (MACs) and nonlinear operations (e.g., max pooling, ReLU etc.) are expensive. This paper proposes an in-memory computing STT-MRAM macro with analog ReLU and pooling layers for efficient neural networks. By implementing the activation and max pooling layers before the ADC through analog domain, the overhead (both latency and energy) of the ADC can be significantly reduced. The macro was implemented in an industrialized 22nm process and the results show that our STT-MRAM IMC macro can reduce 2.02~2.71x energy and 1.8x latency in comparison with various scales of the macro.
What problem does this paper attempt to address?