AMSPM: Adaptive Model Selection and Partition Mechanism for Edge Intelligence-driven 5G Smart City with Dynamic Computing Resources

Xin Niu,Xuejiao Cao,Chen Yu,Hai Jin
DOI: https://doi.org/10.1145/3652516
2024-03-16
ACM Transactions on Sensor Networks
Abstract:With the help of 5G network, edge intelligence (EI) can not only provide distributed, low-latency, and high-reliable intelligent services, but also enable intelligent maintenance and management of smart city. However, the constantly changing available computing resources of end devices and edge servers cannot continuously guarantee the performance of intelligent inference. In order to guarantee the sustainability of intelligent services in smart city, we propose the Adaptive Model Selection and Partition Mechanism (AMSPM) in 5G smart city where EI provides services, which mainly consists of Adaptive Model Selection (AMS) and Adaptive Model Partition (AMP). In AMSPM, the model selection and partition of deep neural network (DNN) are formulated as an optimization problem. Firstly, we propose a recursive-based algorithm named AMS based on the computing resources of edge devices to derive an appropriate DNN model that satisfies the latency demand of intelligent services. Then, we adaptively partition the selected DNN model according to the computing resources of edge devices. The experimental results demonstrate that, when compared with state-of-the-art model selection and partition mechanisms, AMSPM not only reduces latency but also enhances computing resource utilization.
computer science, information systems,telecommunications
What problem does this paper attempt to address?
This paper attempts to address the issue of continuously changing available computing resources in edge devices and edge servers in 5G smart cities, which leads to the inability to consistently guarantee the performance of intelligent services. Specifically, the paper focuses on how to select and dynamically partition the appropriate deep neural network (DNN) model in an Edge Intelligence (EI)-driven 5G smart city to meet the low-latency requirements of intelligent services and improve the utilization of computing resources. ### Main Issues 1. **Variation in Computing Resources**: The computing resources of edge devices and edge servers are dynamically changing, making traditional static model selection and partitioning methods ineffective. 2. **Low Latency Requirements**: Intelligent services need to operate with low latency, and high-complexity DNN models running on resource-limited edge devices may result in significant inference delays. 3. **Utilization of Computing Resources**: While meeting low-latency requirements, it is also necessary to maximize the utilization of computing resources to ensure the sustainability of intelligent services. ### Solution To address the above challenges, the paper proposes an Adaptive Model Selection and Partitioning Mechanism (AMSPM). The main components of AMSPM include: - **Adaptive Model Selection (AMS)**: Selecting the appropriate DNN model based on the current computing resources of the edge device to meet the latency requirements of intelligent services. - **Adaptive Model Partitioning (AMP)**: Dynamically partitioning the selected DNN model into two parts based on the computing resources of the edge device, with one part completed by the edge device and the other by the edge server, to further reduce inference delay. ### Experimental Results Experimental results show that compared to existing model selection and partitioning mechanisms, AMSPM can significantly reduce inference delay and improve the utilization of computing resources, thereby ensuring the sustainability and high quality of intelligent services. ### Conclusion By proposing AMSPM, the paper addresses the challenges brought by the dynamic changes in computing resources of edge devices and edge servers in 5G smart cities, achieving low latency and high resource utilization for intelligent services. This mechanism is of great significance for promoting the application of edge intelligence in 5G smart cities.