Distributed Deep Reinforcement Learning-Based Spectrum and Power Allocation for Heterogeneous Networks

Helin Yang,Jun Zhao,Kwok-Yan Lam,Zehui Xiong,Qingqing Wu,Liang Xiao
DOI: https://doi.org/10.1109/twc.2022.3153175
IF: 10.4
2022-01-01
IEEE Transactions on Wireless Communications
Abstract:This paper investigates the problem of distributed resource management in two-tier heterogeneous networks, where each cell selects its joint device association, spectrum allocation, and power allocation strategy based only on locally-observed information without any central controller. As the optimization problem with devices quality-of-service (QoS) constraints is non-convex and NP-hard, we model it as a Markov decision process (MDP). Considering the fact that the network is highly complex with large state and action spaces, a multi-agent dueling deep-Q network-based algorithm combined with distributed coordinated learning is proposed to effectively learn the optimized intelligent resource management policy, where the algorithm adopts dueling deep network to learn the action-value distribution by estimating both the state-value and action advantage functions. Under the distributed coordinated learning manner and dueling architecture, the learning algorithm can rapidly converge to the optimized policy. Simulation results demonstrate that the proposed distributed coordinated learning algorithm outperforms other existing learning algorithms in terms of learning efficiency, network data rate, and QoS satisfaction probability.
What problem does this paper attempt to address?