Efficient Partitioning and Communication Scheme-Based Distributed Edge Computing to Accelerate Deep Neural Network

Xudong Lu,Cheng Zhuo
DOI: https://doi.org/10.1109/cstic58779.2023.10219320
2023-01-01
Abstract:In the era of the Internet of Things (IoT), there are growing demands to deploying Deep Neural Network (DNN) on edge devices. However, edge devices typically have limited computing resources, which makes it challenging to perform the large computing workloads required for DNN inference. While several methods have been proposed to address this challenge, such as Server-Client pipeline, distributed computing, etc., the workload partitioning and transmission schemes for edge devices have not been well discussed. This paper proposes a distributed edge computing system that leverages efficient partitioning algorithms and a TCP-based communication mechanism to accelerate DNN inference. The experimental results demonstrate that the proposed system with 4 nodes reduces the computation delay by 74.39%. Furthermore, when the number of nodes increases from 2 to 4, the inference is accelerated by 1.98-3.90×, which is 12.98% better than the prior MoDNN system in [6].
What problem does this paper attempt to address?