iPAS: A deep Monte Carlo Tree Search-based intelligent pilot-power allocation scheme for massive MIMO system

Jienan Chen,Siyu Luo,Lin Zhang,Cong Zhang,Bin Cao
DOI: https://doi.org/10.1016/j.dcan.2020.07.009
IF: 6.348
2021-08-01
Digital Communications and Networks
Abstract:Massive Multiple-Input-Multiple-Output (MIMO) is a promising technology to meet the demand for connection of massive devices and high data capacity for mobile networks in the next generation communication system. However, due to the massive connectivity of mobile devices, the pilot contamination problem will severely degrade the communication quality and spectrum efficiency of the massive MIMO system. To address this issue, we propose a deep Monte Carlo Tree Search (MCTS) based intelligent Pilot-Power Allocation Scheme (iPAS). The core of iPAS is a multi-task deep reinforcement learning algorithm that can automatically learn the radio environment and make decisions on the pilot sequence and power allocation to maximize the spectrum efficiency with self-play training. To accelerate the searching convergence, we introduce a Deep Neural Network (DNN) to predict the pilot sequence and power allocation actions. DNN is trained in a self-supervised learning manner, where the training data is generated from the searching process of the MCTS algorithm. Numerical results show that our proposed iPAS achieves a better cumulative distribution function (CDF) of ergodic spectral efficiency compared to the previous suboptimal algorithm.
telecommunications
What problem does this paper attempt to address?