Deep Reinforcement Learning Based Big Data Resource Management for 5G/6G Communications

Zhaoyuan Shi,Xianzhong Xie,Sahil Garg,Huabing Lu,Helin Yang,Zehui Xiong
DOI: https://doi.org/10.1109/globecom46510.2021.9685098
2021-01-01
Abstract:With the advent of the Internet of Everything era, communication data has exploded, which requires more communication resources, such as frequency, time, and energy. In this context, this paper presents a machine learning-based data packet scheduling scheme to achieve efficient data packet transmission in the 5G/6G communication systems. To minimize the average number of packet overflows (APNO), we propose distributed deep deterministic policy gradient (DDPG)-based algorithm for multidimensional resource scheduling. To improve the algorithm stability and training efficiency, the strategy of centralized training and distributed execution is adopted, and an Action Adjuster is designed. The proposed algorithm enables the multidimensional resource management of the 5G/6G commu-nication systems without any information interaction between each agent. Simulation results show that the proposed Action Adjuster DDPG algorithm achieves faster convergence and less data overflow compared to other benchmark algorithms.
What problem does this paper attempt to address?