Work-in-Progress: Toward Energy-efficient Near STT-MRAM Processing Architecture for Neural Networks

Yueting Li,Bingluo Zhao,Xinyi Xu,Yundong Zhang,Jun Wang,Weisheng Zhao
DOI: https://doi.org/10.1109/codes-isss55005.2022.00013
2022-01-01
Abstract:The size of parameters in artificial neural network (NN) applications grows quickly from a handful to the GB-level. The data transmission poses a key challenge for NN, and either neuron is removed or data compression reduces pressure on memory access but cannot successfully decrease data traffic. Therefore, we propose the near spin-transfer-torque magnetic random processing architecture for developing energy-efficient NNs. Our approach provides system architects with a preliminary scheme to obtain real-time transmission that near memory controller directly compresses non-zero elements, and encodes the corresponding index depending on the kernel size. Furthermore, it adjusts the number of multiplication accumulators and avoids unnecessary hardware overheads during computation. The preliminary experimental results demonstrated this design verified with weights that currently achieve up to 3.05x speedup and 29.6% power compared with the unoptimized one.
What problem does this paper attempt to address?