Abstract:Resistive Random Access Memory (ReRAM) based Processing In Memory (PIM) Accelerator has emerged as a promising computing architecture for memory intensive applications, such as Deep Neural Networks (DNNs). However, due to its immaturity, ReRAM devices often suffer from various reliability issues, which hinder the practicality of the PIM architecture and lead to a severe degradation in DNN accuracy. Among various reliability issues, device variation and offset current from High Resistance State (HRS) cell have been considered as major problems in a ReRAM based PIM architecture. Due to these problems, the throughput of the ReRAM based PIM is reduced as fewer wordlines are activated. In this paper, we propose VECOM, a novel approach that includes a variation resilient encoding technique and an offset compensation scheme for a robust ReRAM based PIM architecture. The first technique (i.e., VECOM encoding) is built based on the analysis of the weight pattern distribution of DNN models, along with the insight into the ReRAM's variation property. The second technique, VECOM offset compensation, tolerates offset current in PIM by mapping the conductance of each Multi level Cell (MLC) level added with a specific offset conductance. Experimental results in various DNN models and datasets show that the proposed techniques can increase the throughput of the PIM architecture by up to 9.1 times while saving 50% of energy consumption without any software overhead. Additionally, VECOM is also found to endure low R ratio ReRAM cell (up to 7) with a negligible accuracy drop.

WESCO: Weight-encoded Reliability and Security Co-design for In-memory Computing Systems

The Impact of Non-linear NVM Devices on In-Memory Computing

Reliable Memristor-based Neuromorphic Design Using Variation- and Defect-Aware Training

An 8-Bit in Resistive Memory Computing Core with Regulated Passive Neuron and Bitline Weight Mapping

Novel Weight Mapping Method for Reliable NVM based Neural Network

WeightLock: A Mixed-Grained Weight Encryption Approach Using Local Decrypting Units for Ciphertext Computing in DNN Accelerators

Enhancing Security of Memristor Computing System Through Secure Weight Mapping

Bit-Aware Fault-Tolerant Hybrid Retraining and Remapping Schemes for RRAM-Based Computing-in-Memory Systems

Enabling Secure In-Memory Neural Network Computing by Sparse Fast Gradient Encryption

An area and energy efficient design of domain-wall memory-based deep convolutional neural networks using stochastic computing

Compensation Architecture to Alleviate Noise Effects in RRAM-based Computing-in-memory Chips with Residual Resource

WAGONN: Weight Bit Agglomeration in Crossbar Arrays for Reduced Impact of Interconnect Resistance on DNN Inference Accuracy

Security Enhancement for RRAM Computing System through Obfuscating Crossbar Row Connections

Compensation Architecture Design Utilizing Residual Resource to Mitigate Impacts of Nonidealities in RRAM-based Computing-in-memory Chips

U-SWIM: Universal Selective Write-Verify for Computing-in-Memory Neural Accelerators

Weight and Multiply-Accumulation Sparsity-Aware Non-Volatile Computing-in-Memory System

Bulk-Switching Memristor-Based Compute-In-Memory Module for Deep Neural Network Training

Zero-Space Cost Fault Tolerance for Transformer-based Language Models on ReRAM

Error Detection and Correction Codes for Safe In-Memory Computations

VECOM: Variation Resilient Encoding and Offset Compensation Schemes for Reliable ReRAM Based DNN Accelerator