Power Allocation Based on Deep Reinforcement Learning in HetNets with Varying User Activity.

Yao Chen,Hongtao Zhang
DOI: https://doi.org/10.1109/globecom42002.2020.9322126
2020-01-01
Abstract:Deep learning methods can decrease power allocation algorithm computational complexity, but when the number of users dynamically changed, deep learning model needs to be retrained, which makes them not a practically useful solution. In this paper, a power allocation scheme based on deep QNetwork (DQN) with padding is proposed, which can maximize system’s sum-rate under dynamically changing number of users. Specifically, when a user is inactive, it’s state information cannot get and the input matrix does not match model requirements, but with padding, the missing part in the input matrix will be filled with zeros, which makes this model can manage the dynamic changing number of users. In addition, base stations (BSs) can transmit information with multiple-path, which means users can be severed by several base stations at the same time. The results show that the total system capacity is increased by 18.4% when the scenario have 1 macrocell base station (MBS), 7 smallcell base stations (SBSs), and 20 users, compare with the WMMSE algorithm, 50.0% compare with the max-rss algorithm, and this model can manage uncertain number of users properly.
What problem does this paper attempt to address?