An Efficient and Reliable Negative Margin Timing Error Detection for Neural Network Accelerator Without Accuracy Loss in 28nm CMOS

Ziyu Li,Weiwei Shan,Chengjun Wu,Haitao Ge,Jun Yang
DOI: https://doi.org/10.1109/a-sscc53895.2021.9634809
2021-01-01
Abstract:Energy-efficient neural network (NN) accelerators are essential for IoT and mobile applications, where PVT variations become severe especially in near-threshold voltage (NTV) range. Recent work [1]–[4] applied error detection and correction (EDAC) based adaptive voltage frequency scaling (AVFS) on NN accelerators to eliminate the excess timing margins while decreasing power supply until detecting timing violations (Fig. 1). By using the fault tolerance of NN accelerators to avoid the error correction, they increased energy efficiency a lot. However, NN has limited tolerance to timing errors since a little timing errors will cause serious loss of accuracy, for example, up to 3% accuracy loss in MNIST [2]. Body swapping and adaptive clock techniques have also been adopted to reduce the accuracy loss [3– 4]. Traditional AVFS system monitors the most critical paths and then decreases the voltage until reaching point of first failure (PoFF). However, NN accelerator’s critical paths have distinct characteristics from conventional circuits that makes common EDAC not efficient and risky when applied in NN.
What problem does this paper attempt to address?