Distributed Age-of-Information Scheduling with NOMA Via Deep Reinforcement Learning

Congwei Zhang,Yifei Zou,Zuyuan Zhang,Dongxiao Yu,Jorge Torres Gómez,Tian Lan,Falko Dressler,Xiuzhen Cheng
DOI: https://doi.org/10.1109/tmc.2024.3459101
IF: 6.075
2024-01-01
IEEE Transactions on Mobile Computing
Abstract:Many emerging applications in edge computing require processing of huge volumes of data generated by end devices, using the freshest available information. In this paper, we address the distributed optimization of multi-user long-term average Age-of-Information (AoI) objectives in edge networks that use NOMA transmission. This poses a challenge of non-convex online optimization, which in existing work often requires either decision making in a combinatorial space or a global view of entire network states. To overcome this challenge, we propose a reinforcement learning-based framework that adopts a novel hierarchical decomposition of decision making. Specifically, we propose three different types of distributed agents to learn with respect to efficiency of AoI scheduling, fairness of AoI scheduling, as well as a high-level policy balancing these potentially conflicting design objectives. Not only does the proposed decomposition improve learning performance due to disentanglement of different design objectives/rewards, but it also enables the algorithm to learn the best policy while also learning the explanations – as actions can be directly compared in terms of the design objectives. Our evaluations show that the proposed algorithm improves the long-term average AoI by $200\%-300\%$ and 400% compared to prior works with NOMA and the optimal solution without NOMA, respectively.
What problem does this paper attempt to address?