A 16kb Transpose 6T SRAM In-Memory-Computing Macro Based on Robust Charge-Domain Computing

Jiahao Song,Yuan Wang,Xiyuan Tang,Runsheng Wang,Ru Huang
DOI: https://doi.org/10.1109/a-sscc53895.2021.9634747
2021-01-01
Abstract:Intelligent edge devices are vital in the IoT era when massive data needs to be processed locally for privacy protections. The intelligence always comes with extensive computation and data movement power. In-memory computing (IMC) [1]–[6] has been regarded as a promising route to enable energy-efficient computing in a power constraint platform. Besides, re-training or fine-tuning a part of the neural networks is an effective method to adapt case-by-case variations in edge and protect users’ privacies. Unlike inference, the training phase needs to access the transpose of the weight matrices, as shown in Fig. 1. Very few works have been reported on transpose IMC macros [3], where the current-domain feed-forward (FF) and back-propagation (BP) IMC are supported, and a small offset sense amplifier (SA) is proposed to improve the sensing margin. However, the current-domain computing always suffers from the transistor-variation issue [7]. To mitigate transistor-variation-induced errors and improve the robustness of transpose IMC macros, we present a charge-domain transposable IMC architecture, which supports both the in-memory FF and the BP multiplication-and-accumulation (MAC) computing in the same memory. 6T bit-cells combined with a local charge-based computing unit (CCU) are adopted to improve storage density, where both the FF and the BP computing are realized in the charge domain. Besides, the macro can be reconfigured to execute XNOR or AND, offering great flexibilities to cover binary neural networks (NNs) and multi-bit operations through serially 1-bit operations.
What problem does this paper attempt to address?