A Q-learning Based Dynamic Power Control Algorithm for D2D Communication Underlaying Cellular Networks

Shurui Jiang,Jun Zheng
DOI: https://doi.org/10.1109/wcsp52459.2021.9613167
2021-01-01
Abstract:This paper considers the dynamic power control problem in Device-to-Device (D2D) communication underlaying a cellular network. A resource allocation algorithm is proposed to dynamically perform spectrum and power allocation for cellular users (CUs) and D2D user (DU) pairs in the network. In particular, a Q-learning based dynamic power control (Q-DPC) algorithm is introduced to perform power allocation for DU pairs, which takes into account the minimum throughput requirement of CUs with an objective to maximize the overall system throughput of the network. The Q-DPC algorithm introduces Q-learning in power allocation, and aims to optimize the power allocation for all DU pairs sharing the same spectrum resources in the network at the arrival of a new DU pair, including the new arriving DU pair and all existing ones. Simulation results show that the proposed Q-DPC algorithm outperforms a random power allocation (RPA) algorithm in terms of the overall throughput of the network.
What problem does this paper attempt to address?