Abstract:In recent years, the scaling down that Moore’s Law relies on has been gradually slowing down, and the traditional von Neumann architecture has been limiting the improvement of computing power. Thus, neuromorphic in-memory computing hardware has been proposed and is becoming a promising alternative. However, there is still a long way to make it possible, and one of the problems is to provide an efficient, reliable, and achievable neural network for hardware implementation. In this paper, we proposed a two-layer fully connected spiking neural network based on binary MRAM (Magneto-resistive Random Access Memory) synapses with low hardware cost. First, the network used an array of multiple binary MRAM cells to store multi-bit fixed-point weight values. This helps to simplify the read/write circuit. Second, we used different kinds of spike encoders that ensure the sparsity of input spikes, to reduce the complexity of peripheral circuits, such as sense amplifiers. Third, we designed a single-step learning rule, which fit well with the fixed-point binary weights. Fourth, we replaced the traditional exponential Leak-Integrate-Fire (LIF) neuron model to avoid the massive cost of exponential circuits. The simulation results showed that, compared to other similar works, our SNN with 1184 neurons and 313,600 synapses achieved an accuracy of up to 90.6% in the MNIST recognition task with full-resolution (28 × 28) and full-bit-depth (8-bit) images. In the case of low-resolution (16 × 16) and black-white (1-bit) images, the smaller version of our network with 384 neurons and 32,768 synapses still maintained an accuracy of about 77%, extending its application to ultra-low-cost situations. Both versions need less than 30,000 samples to reach convergence, which is a >50% reduction compared to other similar networks. As for robustness, it is immune to the fluctuation of MRAM cell resistance.

A Compact and Configurable Long Short-Term Memory Neural Network Hardware Architecture.

A Highly Configurable 7.62gop/s Hardware Implementation for LSTM

DaDianNao: A Machine-Learning Supercomputer

A Hardware Implementation of SNN-Based Spatio-Temporal Memory Model.

A 3.89-Gops/mw Scalable Recurrent Neural Network Processor with Improved Efficiency on Memory and Computation

Long short-term memory networks in memristor crossbar arrays

Long short-term memory networks in memristor crossbars

Hardware-Guided Symbiotic Training for Compact, Accurate, yet Execution-Efficient LSTM

Long Short-Term Memory Implementation Exploiting Passive RRAM Crossbar Array

A Power-Efficient Accelerator Based on FPGAs for LSTM Network

Recurrent Neural Networks Hardware Implementation on FPGA

FPGA-based Accelerator for Long Short-Term Memory Recurrent Neural Networks

A Scatter-and-Gather Spiking Convolutional Neural Network on a Reconfigurable Neuromorphic Hardware

A Low-Cost Hardware-Friendly Spiking Neural Network Based on Binary MRAM Synapses, Accelerated Using In-Memory Computing

Implementation and Optimization of the Accelerator Based on FPGA Hardware for LSTM Network

A Cost-Efficient High-Speed VLSI Architecture for Spiking Convolutional Neural Network Inference Using Time-Step Binary Spike Maps

A High Performance Reconfigurable Hardware Architecture for Lightweight Convolutional Neural Network

TripleBrain: An Edge Neuromorphic Architecture for High-accuracy Single-layer Spiking Neural Network with On-chip Self-organizing and Reinforcement Learning

Spatial-Temporal Hybrid Neural Network With Computing-in-Memory Architecture

A Fast and Power Efficient Architecture to Parallelize LSTM based RNN for Cognitive Intelligence Applications.

Memristive LSTM network hardware architecture for time-series predictive modeling problem