Abstract:Device-to-device (D2D) technology enables direct communication between adjacent devices within cellular networks. Due to its high data rate, low latency, and performance improvement in spectrum and energy efficiency, it has been widely investigated and applied as a critical technology in 5G New Radio (NR). In addition to conventional overlay and underlay D2D communications, cooperative D2D communication, which can achieve a win-win situation between cellular users (CUs) and D2D users (DUs) through cooperative relaying technique, has attracted extensive attention from academic and industrial circles in the past decade. This paper delves into optimizing joint spectrum allocation, power control, and link-matching between multiple CUs and DUs for cooperative D2D communications, using weighted sum energy efficiency (WSEE) as the performance metric to address the challenges of green communication and sustainable development. This integer programming problem can be decomposed into a classic weighted bipartite graph matching and a series of nonconvex spectrum allocation and power control problems between potentially matched cellular and D2D link pairs. To address this issue, we propose a hybrid centralized-distributed scheme based on deep reinforcement learning (DRL) and the Kuhn-Munkres (KM) algorithm. Leveraging the latter, the CUs and DUs autonomously optimize spectrum allocation and power control by only utilizing local information. Then, the base station (BS) determines the link matching. Simulation results reveal that it achieves near-optimal performance and significantly enhances the network convergence speed with low signaling overheads. In addition, we also propose and utilize cooperative link sets for corresponding D2D links to accelerate the proposed scheme and reduce signaling exchange further.

Distributed Deep Reinforcement Learning-Based Spectrum and Power Allocation for Heterogeneous Networks

Deep Reinforcement Learning Based Resource Allocation for Heterogeneous Networks

Deep Reinforcement Learning Based Massive Access Management for Ultra-Reliable Low-Latency Communications

Distributed Two-tier DRL Framework for Cell-Free Network: Association, Beamforming and Power Allocation.

Hybrid Centralized-Distributed Resource Allocation Based on Deep Reinforcement Learning for Cooperative D2D Communications

Multi-agent Reinforcement Learning Based Distributed Dynamic Spectrum Access

Deep-Reinforcement-Learning-Based Resource Allocation in ultra-dense network.

Deep Reinforcement Learning for Joint Spectrum and Power Allocation in Cellular Networks

Deep Reinforcement Learning for Distributed Dynamic Power Allocation in Wireless Networks.

Deep Reinforcement Learning for User Association and Resource Allocation in Heterogeneous Cellular Networks

Multi-Agent Deep Reinforcement Learning for Resource Allocation in the Multi-Objective HetNet

Deep Reinforce Learning and Meta-Learning Based Resource Allocation in Cellular Heterogeneous Networks

Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation for D2D Underlay Communications.

Joint Optimization of Handover Control and Power Allocation Based on Multi-Agent Deep Reinforcement Learning

Deep Reinforcement Learning for Multi-Agent Power Control in Heterogeneous Networks

Joint Deep Reinforcement Learning and Unsupervised Learning for Channel Selection and Power Control in D2D Networks

Distributed Multi-Cell Power Control with NAF Reinforcement Learning.

Reinforcement Learning Enhanced Iterative Power Allocation in Stochastic Cognitive Wireless Mesh Networks

Multi-Agent Deep Reinforcement Learning for Dynamic Power Allocation in Wireless Networks.

Spectrum-Energy-Efficient Mode Selection and Resource Allocation for Heterogeneous V2X Networks: A Federated Multi-Agent Deep Reinforcement Learning Approach

Deep Reinforcement Learning Framework For Joint Resource Allocation In Heterogeneous Networks