A Novel Non-Volatile Inverter-based CiM: Continuous Sign Weight Transition and Low Power on-Chip Training

Dong Zhang,Yuye Kang,Gan Liu,Zuopu Zhou,Kaizhen Han,Chen Sun,Leming Jiao,Xiaolin Wang,Yue Chen,Qiwen Kong,Zijie Zheng,Long Liu,Xiao Gong
DOI: https://doi.org/10.48550/arXiv.2209.09083
2022-09-19
Abstract:In this work, we report a novel design, one-transistor-one-inverter (1T1I), to satisfy high speed and low power on-chip training requirements. By leveraging doped HfO2 with ferroelectricity, a non-volatile inverter is successfully demonstrated, enabling desired continuous weight transition between negative and positive via the programmable threshold voltage (VTH) of ferroelectric field-effect transistors (FeFETs). Compared with commonly used designs with the similar function, 1T1I uniquely achieves pure on-chip-based weight transition at an optimized working current without relying on assistance from off-chip calculation units for signed-weight comparison, facilitating high-speed training at low power consumption. Further improvements in linearity and training speed can be obtained via a two-transistor-one-inverter (2T1I) design. Overall, focusing on energy and time efficiencies, this work provides a valuable design strategy for future FeFET-based computing-in-memory (CiM).
Mesoscale and Nanoscale Physics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the high energy consumption and low efficiency faced by current computing architectures when handling artificial intelligence tasks. Specifically, the traditional von Neumann architecture results in a great waste of time and energy due to frequent memory accesses during the data transmission process. To address this issue, this paper proposes a one - transistor - one - inverter (1T1I) design based on non - volatile inverters, aiming to meet the requirements of high - speed and low - power on - chip training. ### Specific Problems and Solutions: 1. **Bottlenecks of Traditional Architectures**: - Traditional GPU designs are limited by the von Neumann architecture. Data is frequently transferred between processing units and memory, leading to a large amount of energy consumption and time delay. - The energy consumption of memory access is two orders of magnitude higher than that of the computation itself, which seriously affects the energy efficiency of the system. 2. **Limitations of Existing Compute - in - Memory (CiM) Schemes**: - Existing CiM technologies such as RRAM, Flash, and PCM have problems such as high operating current and slow operation speed. - On - chip training schemes such as 2T2F (two transistors and two FeFETs) can improve computational accuracy, but face challenges in achieving continuous weight conversion and require additional off - chip computing resources, increasing energy consumption and latency. 3. **Proposed Solution**: - A new 1T1I structure is proposed. By using the ferroelectric properties of doped HfO₂, a non - volatile inverter is realized. - By programming the threshold voltage (\( V_{TH} \)) of the ferroelectric field - effect transistor (FeFET), a continuous transition between positive and negative weights can be achieved without relying on off - chip computing units for signed - weight comparison. - This design can achieve pure on - chip weight conversion under an optimized operating current, thereby achieving high - speed, low - power training. ### Main Contributions: - **High - Speed and Low - Power Training**: The 1T1I design achieves efficient on - chip training under an optimized operating current, significantly reducing energy consumption and increasing the training speed. - **Continuous Weight Conversion**: By adjusting the threshold voltage of the FeFET, a smooth transition from negative weights to positive weights is achieved, solving the problem of difficult continuous weight update in existing designs. - **Experimental Verification**: The feasibility of this design scheme is verified through simulation and experiments, and its advantages in practical applications are demonstrated. In general, this paper proposes an innovative 1T1I design, aiming to overcome the limitations of existing CiM schemes and provide new ideas and technical support for future neural network hardware implementation.