Sum Rate Maximization in Multi-Cell Multi-User Networks: an Inverse Reinforcement Learning-Based Approach

Xingcong Tian,Ke Xiong,Ruichen Zhang,Pingyi Fan,Dusit Niyato,Khaled Ben Letaief
DOI: https://doi.org/10.1109/lwc.2023.3292280
IF: 6.3
2023-01-01
IEEE Wireless Communications Letters
Abstract:Currently, reinforcement learning (RL) is widely used in wireless network optimization, where the key factor is to design the efficient reward functions manually. However, the limitations of the man-made reward function are mainly that it is subject to human subjective factors and requires extensive simulations to select appropriate parameters. To cope with this, the letter proposes an inverse reinforcement learning (IRL)-based approach to optimize the transmit power control in multi-cell multi-user networks, which does not require any prior knowledge about the system and the wireless environment. An optimization problem is formulated to maximize the achievable sum information rate subject to the minimal required information rate of users. To tackle the NP-hard non-convex problem, our IRL-based approach is able to obtain rewards automatically rather than through a man-made reward function. In particular, the successive convex approximation (SCA)-based approach is adopted as the expert-policy to generate the expert data to train the automatic rewards. Simulation results support that the proposed IRL-based approach outperforms the traditional RL-based ones.
What problem does this paper attempt to address?