Optimizing Mobility-Aware Task Offloading in Smart Healthcare for Internet of Medical Things Through Multi-Agent Reinforcement Learning

Chongwu Dong,Yanbin Sun,Muhammad Shafiq,Ning Hu,Yuan Liu,Zhihong Tian
DOI: https://doi.org/10.1109/jiot.2023.3338718
IF: 10.6
2024-01-01
IEEE Internet of Things Journal
Abstract:In the scenario of smart healthcare applications, the Internet of Medical Things (IoMT) devices, equipped with limited resources, would offload numerous computation-heavy tasks to an edge server through 5G networks. However, IoMT devices should usually move around different diagnostic areas in smart healthcare systems, leading to the dynamics of the uplink channel quality. Moreover, the burst generation of a substantial number of tasks from IoMT devices can result in congestion within the computing queue of the edge server. And, heterogeneous services in IoMT devices make it hard to collect global information for a central controller to get the optimal optimization for all IoMT devices. So, how to determine task offloading among IoMT devices in a distributed scenario of smart healthcare applications should be considered appropriately and comprehensively. In this paper, we investigate task offloading in mobile edge computing (MEC) through wireless networks. To improve the utilization of wireless resources, non-orthogonal multiple access (NOMA) is adopted in 5G networks. We first formulate the mobility of IoMT devices as a Hidden Markov Model (HMM) and the problem of task offloading policy as a distributed Partial Markov Decision Process (Dec-POMDP). Then, we propose a mobility-aware method based on Multi-agent reinforcement learning for task offloading in 5G NOMA-enabled networks. In our approach, task offloading scheduling for each IoMT device in NOMA-enabled 5G networks is considered to improve energy efficiency and guarantee service quality. Besides, the time complexity and the existence of a Nash equilibrium for our proposed Dec-POMDP method are theoretically derived. Simulations are conducted to show that our algorithm outperforms other alternative methods in energy consumption under the delay constraint.
What problem does this paper attempt to address?