Buffer-Aware Wireless Scheduling Based On Deep Reinforcement Learning

Xu Chen,Wang Jian,Yu Tianhang,Kong Chuili,Huangfu Yourui,Li Rong,Ge Yiqun,Wang Jun
DOI: https://doi.org/10.1109/WCNC45663.2020.9120729
2020-01-01
Abstract:In this paper, the downlink packet scheduling problem for cellular networks is modeled, which jointly optimizes throughput, fairness and packet drop rate. Two genie-aided heuristic search methods are employed to explore the solution space. A deep reinforcement learning (DRL) framework with Advantage actor-critic (A2C) algorithm is proposed for the optimization problem. Several methods have been utilized in the framework to improve the sampling and training efficiency and to adapt the algorithm to a specific scheduling problem. Numerical results show that DRL outperforms the baseline algorithm and achieves similar performance as genie-aided methods without using the future information.
What problem does this paper attempt to address?