Abstract:Device-to-device (D2D) technology, which allows direct communications between proximal devices, is widely acknowledged as a promising candidate to alleviate the mobile traffic explosion problem. In this paper, we consider an overlay D2D network, in which multiple D2D pairs coexist on several orthogonal spectrum bands, i.e., channels. Due to spectrum scarcity, the number of D2D pairs is typically more than that of available channels, and thus multiple D2D pairs may use a single channel simultaneously. This may lead to severe co-channel interference and degrade network performance. To deal with this issue, we formulate a joint channel selection and power control optimization problem, with the aim to maximize the weighted-sum-rate (WSR) of the D2D network. Unfortunately, this problem is non-convex and NP-hard. To solve this problem, we first adopt the state-of-art fractional programming (FP) technique and develop an FP-based algorithm to obtain a near-optimal solution. However, the FP-based algorithm requires instantaneous global channel state information (CSI) for centralized processing, resulting in poor scalability and prohibitively high signalling overheads. Therefore, we further propose a distributed deep reinforcement learning (DRL)-based scheme, with which D2D pairs can autonomously optimize channel selection and transmit power by only exploiting local information and outdated nonlocal information. Compared with the FP-based algorithm, the DRL-based scheme can achieve better scalability and reduce signalling overheads significantly. Simulation results demonstrate that even without instantaneous global CSI, the performance of the DRL-based scheme can approach closely to that of the FP-based algorithm.

Q- Learning Based Power Control Algorithm for D2d Communication

A Multi-agent Reinforcement Learning Based Power Control Algorithm for D2D Communication Underlaying Cellular Networks

Power Control Based on DRL Algorithm for D2D-Enabled Networks

A Q-learning Based Dynamic Power Control Algorithm for D2D Communication Underlaying Cellular Networks

A Reinforcement Learning Based Joint Spectrum Allocation and Power Control Algorithm for D2D Communication Underlaying Cellular Networks

Resource Allocation and Power Control Policy for Device-to-Device Communication Using Multi-Agent Reinforcement Learning

Multi-Agent Deep Reinforcement Learning-Based Power Control and Resource Allocation for D2D Communications

Power Control for D2D Communication Using Multi-Agent Reinforcement Learning

Energy-Efficient D2D Communications Based on Centralised Reinforcement Learning Techniques.

Deep Multi-Agent Reinforcement Learning for Resource Allocation in D2D Communication Underlaying Cellular Networks

Energy-Efficient Power Control and Resource Allocation Based on Deep Reinforcement Learning for D2D Communications in Cellular Networks

Deep Reinforcement Learning Based Power Allocation for D2D Network

Power Optimization in Device-to-Device Communications: A Deep Reinforcement Learning Approach with Dynamic Reward.

A Neural Network Based Power Allocation Algorithm for D2D Communication in Cellular Networks

Power Allocation in Multi-User Cellular Networks: Deep Reinforcement Learning Approaches

Joint Mode Selection and Power Adaptation for D2D Communication with Reinforcement Learning.

D2D Communication Resource Allocation Algorithm Based on Multi-Agent Reinforcement Learning

Deep Reinforcement Learning for Joint Channel Selection and Power Control in D2D Networks

Double Deep Q-Network Based Distributed Resource Matching Algorithm for D2D Communication

Joint Deep Reinforcement Learning and Unsupervised Learning for Channel Selection and Power Control in D2D Networks

Energy-efficient Power Control Algorithm for D2D Communication