Abstract:Device-to-device (D2D) communication has been regarded as a promising solution to alleviate the mobile traffic explosion problem for its capabilities of improving system data rate and resource utilization. A reconfigurable intelligent surface (RIS) aided mobile D2D communications framework is investigated, where the RIS is deployed to improve communication quality. As the transmission distance of D2D pairs changes, the mode selection for D2D pairs and the phase shift design for RIS is essential for mobile scenarios. Therefore, we formulate a joint optimization problem of mode selection, channel assignment, power allocation, and discrete phase shift selection to maximize the average sum data rate of D2D pairs. This problem is also constrained by the maximum transmit power and the minimum data rate requirements of users, where the latter is to guarantee the fairness of D2D pairs. We first reformulate the original sequential decision-making problem into a Markov game (MG) problem to solve the challenging optimization. Furthermore, a multi-agent deep reinforcement learning (MADRL) framework is proposed, in which multiple agents cooperatively determine the joint mode selection and resource allocation strategy. The proposed MADRL-based framework combines both the multi-pass deep Q-networks (MP-DQN) algorithm and the decaying DQN algorithm to solve the optimization problem. Specifically, we adopt the MP-DQN algorithm for D2D pairs to handle the hybrid discrete-continuous action space. Moreover, the decaying DQN algorithm is invoked by the RIS agent to select discrete phase shifts. Simulation results demonstrate that the proposed algorithm can converge under different cases. The proposed MADRL-based algorithm outperforms the combination algorithm of DQN and the deep deterministic policy gradient (DDPG) in terms of system performance. Moreover, it is also shown that the average sum data rate of D2D pairs can be significantly improved by deploying the RIS and further enhanced by increasing the number of reflecting elements (REs).

Joint Mode Selection and Power Adaptation for D2D Communication with Reinforcement Learning.

Joint Mode Selection and Resource Allocation for Device-to-Device Communications

Joint Mode Selection, Link Allocation and Power Control in Underlaying D2D Communication.

Joint Mode Selection and Resource Allocation in Underlaying D2D Communication.

Joint mode selection, MCS assignment, resource allocation and power control for D2D communication underlaying cellular networks

Energy-efficient Mode Selection and Power Control for Device-to-device Communications.

Joint Deep Reinforcement Learning and Unsupervised Learning for Channel Selection and Power Control in D2D Networks

Mode Switching for Energy-Efficient Device-to-Device Communications in Cellular Networks

Joint Power Allocation and Reuse Partner Selection for Device-To-Device Communications

Deep Reinforcement Learning for Joint Channel Selection and Power Control in D2D Networks

Social-aware Energy-Efficient Joint Mode Selection and Link Allocation in D2D Communications

Deep Multi-Agent Reinforcement Learning for Resource Allocation in D2D Communication Underlaying Cellular Networks

Research on Joint Mode Selection and Resource Allocation Scheme in D2D Networks

Double Deep Q-Network Based Distributed Resource Matching Algorithm for D2D Communication

Deep Reinforcement Learning Empowered Joint Mode Selection and Resource Allocation for RIS-aided D2D Communications

Resource Allocation and Power Control Policy for Device-to-Device Communication Using Multi-Agent Reinforcement Learning

Social-aware Joint Mode Selection and Link Allocation for Device-to-device Communication Underlaying Cellular Networks

A Multi-agent Reinforcement Learning Based Power Control Algorithm for D2D Communication Underlaying Cellular Networks

A Joint Relay Selection and Power Allocation Strategy for Multi-Hop D2D Communications

Energy-Efficient Mode Selection and Resource Allocation for D2D-Enabled Heterogeneous Networks: A Deep Reinforcement Learning Approach

QoS-aware joint mode selection and channel assignment for D2D communications