Shavette: Low Power Neural Network Acceleration via Algorithm-level Error Detection and Undervolting

Mikael Rinkinen,Lauri Koskinen,Olli Silven,Mehdi Safarpour
2024-10-17
Abstract:Reduced voltage operation is an effective technique for substantial energy efficiency improvement in digital circuits. This brief introduces a simple approach for enabling reduced voltage operation of Deep Neural Network (DNN) accelerators by mere software modifications. Conventional approaches for enabling reduced voltage operation e.g., Timing Error Detection (TED) systems, incur significant development costs and overheads, while not being applicable to the off-the-shelf components. Contrary to those, the solution proposed in this paper relies on algorithm-based error detection, and hence, is implemented with low development costs, does not require any circuit modifications, and is even applicable to commodity devices. By showcasing the solution through experimenting on popular DNNs, i.e., LeNet and VGG16, on a GPU platform, we demonstrate 18% to 25% energy saving with no accuracy loss of the models and negligible throughput compromise (< 3.9%), considering the overheads from integration of the error detection schemes into the DNN. The integration of presented algorithmic solution into the design is simpler when compared conventional TED based techniques that require extensive circuit-level modifications, cell library characterizations or special support from the design tools.
Hardware Architecture,Artificial Intelligence
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is: **How to achieve low - power operation of deep neural network (DNN) accelerators through software - level error detection and undervolting techniques, thereby improving energy efficiency and reducing energy consumption while maintaining model accuracy and throughput**. Specifically, the paper proposes an algorithm - based error detection method (Algorithm - Based Fault Tolerance, ABFT), and through experiments shows the application effect of this method on popular DNN models such as LeNet and VGG16 on the GPU platform. This method can safely reduce the voltage without modifying the hardware circuit, thus saving 18% to 25% of energy, and will not cause loss of model accuracy or significant throughput decline (<3.9%). ### Summary of main problems: 1. **Reduce power consumption**: Reduce the energy consumption of DNN accelerators through undervolting operations. 2. **Ensure reliability**: Use ABFT for error detection to ensure the correctness of calculations at low voltages. 3. **Applicability**: This method is applicable to off - the - shelf commercial devices and does not require complex hardware modifications or support from special design tools. ### Key technical points: - **Undervolting operation**: Reduce the operating voltage to reduce power consumption. - **ABFT**: Ensure the computational accuracy at low voltages through an error detection mechanism at the algorithm level. - **Dynamic voltage and frequency scaling (DVFS)**: Dynamically adjust the voltage and frequency according to the error detection results. Through these techniques, the paper shows an efficient, low - cost and easy - to - implement method that can significantly improve the energy efficiency of DNN accelerators without sacrificing performance.