Joint Beamforming and Power Control for MIMO-NOMA with Deep Reinforcement Learning

Tongwei Lu,Haijun Zhang,Keping Long
DOI: https://doi.org/10.1109/ICC42927.2021.9500713
2021-01-01
Abstract:In current research, reinforcement learning (RL) is widely applied to resource management of wireless communication networks. However, many optimization problems have high computational complexity, and traditional RL fails to solve continuous high-dimensional problems. This paper investigates the sum rate problem in single-cell multiuser multiple-input multiple-output (MIMO) non-orthogonal multiple access (NOMA) network. In our scenario, users are separated into two groups, while ensuring the lowest target rate among one group of users, compute the maximum sum rate for the other group of users. For the sake of tackling with the non-convex optimization problem and acquiring the maximum sum rate, we design the joint beamforming and power control algorithm based on deep reinforcement learning (DRL) for deep Q-network (DQN) and double DQN. The final simulation section verifies the convergence and feasibility of the proposed algorithm which can achieve significant sum-rate gains.
What problem does this paper attempt to address?