Scheduling Real-Time Wireless Traffic: A Network-Aided Offline Reinforcement Learning Approach

Jialin Wan,Sen Lin,Zhaofeng Zhang,Junshan Zhang,Tao Zhang
DOI: https://doi.org/10.1109/jiot.2023.3304969
IF: 10.6
2023-01-01
IEEE Internet of Things Journal
Abstract:Real-time traffic has stringent requirements in terms of latency, and deadline guarantees on packet delivery play a vital role in real-time IoT applications. Deadline-aware wireless scheduling of real-time traffic has been a long-standing open problem, despite significant efforts using analytical methods. Departing from the conventional approaches, this work studies deadline-aware traffic scheduling by taking an offline reinforcement learning (RL) approach to train scheduling algorithms, ready to be used for online scheduling. To address the challenges therein, we propose a network-aided offline RL (NA-ORL) framework for deadline-aware scheduling, by making use of the fact that the network dynamics follows a well-defined physics model. Specifically, in NA-ORL the initialization of the scheduling policy is obtained through behavior cloning with a good model-based scheduling algorithm, and the network-aided actor–critic (A–C) method is utilized to train a better scheduling policy with carefully designed states and reward function, thanks to its nature of policy improvement. Building on NA-ORL, we further devise a network-aided offline meta-RL (NA-MRL) algorithm to deal with the nonstationary network dynamics. Extensive experimental results demonstrate that the proposed NA-ORL and NA-MRL algorithms can achieve better performance over adaptive mixing over nondominated links (AMIX-ND) and largest-deficit-first (LDF), in various scenarios for the deadline-aware wireless scheduling.
What problem does this paper attempt to address?