Abstract:Device-to-device (D2D) technology, which allows direct communications between proximal devices, is widely acknowledged as a promising candidate to alleviate the mobile traffic explosion problem. In this paper, we consider an overlay D2D network, in which multiple D2D pairs coexist on several orthogonal spectrum bands, i.e., channels. Due to spectrum scarcity, the number of D2D pairs is typically more than that of available channels, and thus multiple D2D pairs may use a single channel simultaneously. This may lead to severe co-channel interference and degrade network performance. To deal with this issue, we formulate a joint channel selection and power control optimization problem, with the aim to maximize the weighted-sum-rate (WSR) of the D2D network. Unfortunately, this problem is non-convex and NP-hard. To solve this problem, we first adopt the state-of-art fractional programming (FP) technique and develop an FP-based algorithm to obtain a near-optimal solution. However, the FP-based algorithm requires instantaneous global channel state information (CSI) for centralized processing, resulting in poor scalability and prohibitively high signalling overheads. Therefore, we further propose a distributed deep reinforcement learning (DRL)-based scheme, with which D2D pairs can autonomously optimize channel selection and transmit power by only exploiting local information and outdated nonlocal information. Compared with the FP-based algorithm, the DRL-based scheme can achieve better scalability and reduce signalling overheads significantly. Simulation results demonstrate that even without instantaneous global CSI, the performance of the DRL-based scheme can approach closely to that of the FP-based algorithm.

Inverse Reinforcement Learning Meets Power Allocation in Multi-user Cellular Networks

Sum Rate Maximization in Multi-Cell Multi-User Networks: an Inverse Reinforcement Learning-Based Approach

Power Allocation in Multi-User Cellular Networks: Deep Reinforcement Learning Approaches

IRL-PM: An Inverse Reinforcement Learning-based Power Minimization in Multi-User MISO Networks.

Delay-Aware Power Control for Downlink Multi-User MIMO Via Constrained Deep Reinforcement Learning.

Joint BS-User Association, Power Allocation, and User-Side Interference Cancellation in Cell-free Heterogeneous Networks

Multi-Agent Reinforcement Learning Based Unlicensed Resource Sharing for LTE-U Networks.

A Deep Reinforcement Learning-Based Technique for Optimal Power Allocation in Multiple Access Communications

BiLSTM Based Reinforcement Learning for Resource Allocation and User Association in LTE-U Networks

A Dynamic Power Allocation Scheme in Power-Domain NOMA Using Actor-Critic Reinforcement Learning.

Deep Reinforcement Learning for Joint Spectrum and Power Allocation in Cellular Networks

Multi-objective Bandwidth and Power Allocation for Energy-Efficient Uplink Communications

Downlink Power Control for Cell-Free Massive MIMO with Deep Reinforcement Learning

Tradeoff Between Network Energy Consumption and Terminal Energy Consumption Via Small Cell Power Control

Resource allocation in multi-user cellular networks: A transformer-based deep reinforcement learning approach

Reinforcement Learning Enhanced Iterative Power Allocation in Stochastic Cognitive Wireless Mesh Networks

Cellular Network Power Allocation Algorithm Based on Deep Reinforcement Learning and Artificial Intelligence

Deep Reinforcement Learning-Based Power Allocation for Minimizing Age of Information and Energy Consumption in Multi-Input Multi-Output and Non-Orthogonal Multiple Access Internet of Things Systems

A Multi-agent Reinforcement Learning Based Power Control Algorithm for D2D Communication Underlaying Cellular Networks

Joint mode switching and resource allocation in wireless-powered RIS-aided multiuser communication systems

Deep Reinforcement Learning for Joint Channel Selection and Power Control in D2D Networks