Handover Optimization Via Asynchronous Multi-User Deep Reinforcement Learning

Zhi Wang,Lihua Li,Yue Xu,Hui Tian,Shuguang Cui
DOI: https://doi.org/10.1109/ICC.2018.8422824
2018-01-01
Abstract:In this paper, an asynchronous multi-user deep reinforcement learning scheme is developed to control the handover (HO) processes across multiple user equipments (UEs), in the goal of lowering the HO rate while ensuring certain system throughput. In this scheme, we use a deep neural network (DNN) as an HO controller learned by each UE via reinforcement learning in a collaborative fashion. Moreover, we use supervised learning in initializing the DNN controller before the execution of reinforcement learning to exploit what we already know with traditional HO schemes and to mitigate the negative effects of random exploration at the initial stage. Furthermore, we show that the adopted global-parameter-based framework enables us to train faster with more UEs, which could nicely address the scalability issue to support large systems. Finally, simulation results demonstrate that the proposed framework can achieve better performance than the state-of-art on-line schemes, in terms of HO rates.
What problem does this paper attempt to address?