A Dynamic Execution Neural Network Processor for Fine-Grained Mixed-Precision Model Training Based on Online Quantization Sensitivity Analysis

Ruoyang Liu,Chenhan Wei,Yixiong Yang,Wenxun Wang,Binbin Yuan,Huazhong Yang,Yongpan Liu
DOI: https://doi.org/10.1109/jssc.2024.3377292
IF: 5.4
2024-01-01
IEEE Journal of Solid-State Circuits
Abstract:BSTcontrol As neural network (NN) training cost red has been growing exponentially over the past decade, developing high-speed and energy-efficient training methods has become an urgent task. Fine-grained mixed-precision low-bit training is the most promising way for high-efficiency training, but it needs dedicated processor designs to overcome the overhead in control, storage, and I/O and remove the power bottleneck in floating-point (FP) units. This article presents a dynamic execution NN processor supporting fine-grained mixed-precision training through an online quantization sensitivity analysis. Three key features are proposed: the quantization-sensitivity-aware dynamic execution controller, dynamic bit-width adaptive datapath design, and the low-power multi-level-aligned block-FP unit (BFPU). This chip achieves 13.2-TFLOPS/W energy efficiency and 1.07-TFLOPS/mm $^2$ area efficiency.
engineering, electrical & electronic
What problem does this paper attempt to address?