DNN Inference Task Offloading Based on Distributed Soft Actor-Critic in Mobile Edge Computing.

Wenxiu Xu,Ningjiang Chen,Huan Tu
DOI: https://doi.org/10.18293/seke2023-150
2023-01-01
Abstract:In mobile edge computing, DNN-driven intelligent inference service is highly sensitive to latency.Recently, collaborative inference between user devices and Edge Servers (ESs) based on DNN partition has been used in service acceleration.However, due to the limited computing resources of ESs, there is resource competition between concurrent requests, resulting in the partition tasks cannot be offloaded to ESs in time.Therefore, it is necessary to design an efficient offloading scheme for partitionbased concurrent inference tasks.Existing task offloading schemes based on Deep Reinforcement Learning (DRL) can solve complex decision-making problems in high-dimensional state space, but there are problems such as insufficient sample diversity and easily falling into local optimum.Therefore, we propose a collaborative DNN inference task offloading scheme based on distributed Soft Actor-Critic(SAC).It supports SAC Agents to explore samples in parallel and share learning experiences, and improves the randomness of the policy through the maximum entropy mechanism to avoid falling into local optimum, thus achieving efficient offloading of concurrent partition tasks.Experimental results on DNN benchmarks show that compared with the baseline schemes, the average service latency of our scheme is reduced by more than 18.3%, and it has a higher convergence speed and task success rate, which can make ESs achieve load balancing.
What problem does this paper attempt to address?