Abstract:The training of neural network (NN) is usually time-consuming and resource intensive. Memristor has shown its potential in computation of NN. Especially for the metal-oxide resistive random access memory (RRAM), its crossbar structure and multi-bit characteristic can perform the matrix-vector product in high precision, which is the most common operation of NN. However, there exist two challenges on realizing the training of NN. Firstly, the current architecture can only support the inference phase of training and cannot perform the backpropagation (BP), the weights update of NN. Secondly, the training of NN requires enormous iterations and constantly updates the weights to reach the convergence, which leads to large energy consumption because of lots of write and read operations. In this work, we propose a novel architecture, TIME, and peripheral circuit designs to enable the training of NN in RRAM. TIME supports the BP and the weights update while maximizing the reuse of peripheral circuits for the inference operation on RRAM. Meanwhile, a variability-free tuning scheme and gradually-write circuits are designed to reduce the cost of tuning RRAM. We explore the performance of both SL (supervised learning) and DRL ( deep reinforcement learning) in TIME, and a specific mapping method of DRL is also introduced to further improve the energy efficiency. Experimental results show that, in SL, TIME can achieve 5.3x higher energy efficiency on average compared with the most powerful application-specific integrated circuits (ASIC) in the literature. In DRL, TIME can perform averagely 126x higher than GPU in energy efficiency. If the cost of tuning RRAM can be further reduced, TIME have the potential of boosting the energy efficiency by 2 orders of magnitude compared with ASIC.

TIME: A Training-in-Memory Architecture for RRAM-Based Deep Neural Networks

Long Live TIME: Improving Lifetime for Training-in-memory Engines by Structured Gradient Sparsification.

Energy Efficient RRAM Spiking Neural Network for Real Time Classification

RRAM based learning acceleration.

High-Throughput In-Memory Computing for Binary Deep Neural Networks with Monolithically Integrated RRAM and 90nm CMOS

Spiking Neural Network with RRAM: Can We Use It for Real-World Application?

A flexible and fast digital twin for RRAM systems applied for training resilient neural networks

SNrram: an Efficient Sparse Neural Network Computation Architecture Based on Resistive Random-Access Memory.

RRAM-DNN: an RRAM and Model-Compression Empowered All-Weights-On-Chip DNN Accelerator

Training Low Bitwidth Convolutional Neural Network on RRAM

RRAM-based Coprocessors for Deep Learning

Accurate Program/Verify Schemes of Resistive Switching Memory (RRAM) for In-Memory Neural Network Circuits

A 3d Multi-Layer Cmos-Rram Accelerator for Neural Network

Compensation Architecture to Alleviate Noise Effects in RRAM-based Computing-in-memory Chips with Residual Resource

A Low-Latency DNN Accelerator Enabled by DFT-Based Convolution Execution Within Crossbar Arrays

A compute-in-memory chip based on resistive random-access memory

AERIS: Area/Energy-Efficient lT2R ReRAM Based Processing-in-Memory Neural Network System-on-a-Chip

Compensation Architecture Design Utilizing Residual Resource to Mitigate Impacts of Nonidealities in RRAM-based Computing-in-memory Chips

An On-chip Layer-wise Training Method for RRAM Based Computing-in-memory Chips.

Device and circuit optimization of RRAM for neuromorphic computing

NEON: Enabling Efficient Support for Nonlinear Operations in Resistive RAM-based Neural Network Accelerators