Inverse Reinforcement Learning Meets Power Allocation in Multi-user Cellular Networks

Ruichen Zhang,Ke Xiong,Xingcong Tian,Yang Lu,Pingyi Fan,Khaled Ben Letaief
DOI: https://doi.org/10.1109/infocomwkshps54753.2022.9798257
2022-01-01
Abstract:This paper proposes an inverse reinforcement learning (IRL)-based method to optimize power allocation for multiuser cellular networks. An optimization problem is formulated to maximize the achievable sum information rate of all receivers. In contrast to traditional reinforcement learning (RL)-based methods, the proposed IRL-based one does not require to design the reward function manually, which is able to determine the reward function efficiently and automatically from the expert policy. The weighted minimum mean square error (WMMSE) method is used to serve as an expert policy to obtain the reward function, and the action space and state space are designed. Simulation results show that the proposed IRL-based method achieves about 99 % of the sum information rate achieved by the pure WMMSE method, but the running time of the proposed IRL-based one is about 1/19 of that required of pure WMMSE method.
What problem does this paper attempt to address?