Transposable Memory Based on the Ferroelectric Field-Effect Transistor

Jianze Wang,Wei Zhang,Zhen Wu,Yimin Wang,Leming Jiao,Xiaolin Wang,Xiao Gong,Xuanyao Fong
DOI: https://doi.org/10.1109/iscas58744.2024.10558599
2024-01-01
Abstract:Non-volatile ferroelectric field-effect transistor (FeFET) technology is a promising CMOS process compatible solution for fast, energy efficient on-chip memories that are needed to enable ubiquitous deployment of AI. Existing memories are linear by design and the bandwidth may not be sufficient to support transformers that underlie the now-popular Large Language Models (LLMs) for generative AI, which require fast accesses to matrices and their transposes. In this work, two designs of transposable FeFET memories are proposed to address this challenge. We evaluate our designs using compact models for the FeFET devices, which we have calibrated to experimentally measured device characterization data. We demonstrate that 1 ns read latencies are possible, which underscores the potential of FeFET technology for AI accelerators. The area efficiency of our proposed dual-gated transposable memory and the speed of transpose operations are studied, showing that it occupies 30.02 F-2 cell area and is capable of maximum 82.24x faster transpose operation compared to non-transposable memory.
What problem does this paper attempt to address?