Towards Energy-Efficient Collaborative Inference Using Multi-System Approximations

Arghadip Das,Soumendu Kumar Ghosh,Arnab Raha,Vijay Raghunathan
DOI: https://doi.org/10.1109/jiot.2024.3365306
IF: 10.6
2024-01-01
IEEE Internet of Things Journal
Abstract:Cooperative inference applications have seen considerable potential with distributed deep neural networks (DDNNs). One use for DDNNs is the classification of 3D objects from a set of 2D images or views. This approach is also known as multiview convolutional neural networks (MVCNN). However, due to the intensive computational demands, substantial communication overhead, high inference delay, and energy limits, it is difficult to deploy MVCNN on resource-constrained edge devices. This paper proposes for the first time the concept of distributed approximate systems (DRAX), which employs a multi-device approach to approximate computing and uses synergistic approximations of various edge computing systems to enable energy-efficient collaborative DDNN inference. DRAX performs a significance-aware approximation of multiple nodes and prunes the large design space using the non-uniform contribution of various perspectives/views to the final inference to achieve optimal quality-energy trade-off. In addition, we also propose a novel remaining energy-aware heuristic, which dynamically chooses the approximation degree based on the user-provided quality bounds and further increases the system lifetime. The experimental results obtained from a prototype of a 12-view 3D object classification system implemented on an Intel Stratix IV FPGA development board demonstrate substantial energy savings (2.6× to 8×) for minimal (< 1%) application-level quality loss.
computer science, information systems,telecommunications,engineering, electrical & electronic
What problem does this paper attempt to address?