Collaborative Intelligent Resource Trading for RAN Slicing: A Federated Policy Distillation Approach

Daniel Ayepah-Mensah,Guolin Sun
DOI: https://doi.org/10.1109/NCIC61838.2023.00018
2023-01-01
Abstract:In Radio Access Network (RAN), the sharing of resources can be modeled as a trading process in which multiple Mobile Virtual Network Operators (MVNOs) buy and sell resources according to their needs. This process can be competitive, with each MVNO strategically pricing and managing resources to maximize utility. Despite the dynamic nature of RAN slicing, deep-reinforcement learning (DRL) solutions often perform best but can be impractical due to their centralized nature and the need for full cooperation. These methods may face inconsistency due to MVNO heterogeneity, leading to imbalanced data distribution and potential selfishness, complicating optimal solution achievement. This paper proposes a collaborative intelligent framework for resource trading based on the Federated Deep Reinforcement Learning framework with Mutual Policy Distillation (FDRL-MPD), which enables MVNOs to collaborate and learn personalized trading models. Furthermore, we proposed a reward-shaping mechanism based on the proxy policy optimization (PPO) algorithm for local resource trading. Simulations performed with several MVNOs confirm the effectiveness of the proposed framework, especially concerning the algorithm's robustness to not independent and identically distributed data.
What problem does this paper attempt to address?