Abstract:Hybrid radio frequency (RF) and visible light communication (VLC) networks can provide high throughput and energy efficiency with VLC access points (APs) while ensuring ubiquitous coverage with RF APs. Due to dynamic channel conditions and limited resources, the hybrid RF/VLC networks resource allocation problem is complex and challenging. Conventional resource allocation techniques fail to overcome these challenges. Heuristic methods can solve high complexity problems; however, they are not robust against changes such as dynamic channel conditions or alternating user requirements. Heuristic methods require centralized control for stability which adds communication overhead between APs. Deep Reinforcement Learning (DRL) based solutions can solve high complexity, dynamic channel conditions, and alternating user requirements while not requiring centralized control. In this paper, we formulate a distributed downlink power allocation problem to optimize the transmit power for users to reach target data rates in hybrid RF/VLC networks. Then, we propose a distributed DRL-based algorithm Deep Deterministic Policy Gradient (DDPG), to solve the formulated computationally-intensive problem. We implement a simulation environment to benchmark the proposed distributed DRL-based method against other methods such as Q-Learning (QL) and Deep Q-Networks (DQN), and centralized heuristic power allocation algorithms. Our simulation results show that the distributed DDPG-based algorithm learns to adapt against changes in the channel or user requirements, while centralized Genetic Algorithm and Particle Swarm Optimization-based algorithms fail to endure against these changes even with coordination between APs. Additionally, we quantify the performance of the DDPG-based algorithm to prevail amid DRL-based algorithms at the expense of higher implementation complexity.

Power Allocation in Ultra-Dense Networks Through Deep Deterministic Policy Gradient

Power Allocation for Full-Duplex Communication Systems Based on Deep Deterministic Policy Gradient

Delay-Aware Power Control for Downlink Multi-User MIMO Via Constrained Deep Reinforcement Learning.

Power Allocation in Multi-User Cellular Networks: Deep Reinforcement Learning Approaches

Low-Coupling Policy Optimization Framework for Power Allocation in Ultra-Dense Small-Cell Networks

Accelerating Deep Reinforcement Learning With the Aid of Partial Model: Energy-Efficient Predictive Video Streaming

Power Allocation Based on Multi-Agent Deep Deterministic Policy Gradient for Underwater Acoustic Communication Networks

DDPG with Transfer Learning and Meta Learning Framework for Resource Allocation in Underlay Cognitive Radio Network

Joint EH Time and Transmit Power Optimization Based on DDPG for EH Communications

Joint Interference Alignment and Power Control for Dense Networks Via Deep Reinforcement Learning

Deep Deterministic Policy Gradient for Relay Selection and Power Allocation in Cooperative Communication Network

User Association and Power Allocation for User-Centric Smart-Duplex Networks via Tree-Structured Deep Reinforcement Learning

A Deep Reinforcement Learning-Based Technique for Optimal Power Allocation in Multiple Access Communications

Downlink Power Control for Cell-Free Massive MIMO with Deep Reinforcement Learning

Network Architecture for Optimizing Deep Deterministic Policy Gradient Algorithms

Distributed DRL-based Downlink Power Allocation for Hybrid RF/VLC Networks

Deep Reinforcement Learning-based Power Control and Bandwidth Allocation Policy for Weighted Cost Minimization in Wireless Networks

Learning Deterministic Policy with Target for Power Control in Wireless Networks

Deep Reinforcement Learning-Assisted Energy Harvesting Wireless Networks

5G Multi-Slices Bi-Level Resource Allocation by Reinforcement Learning

A power allocation scheme using non-cooperative game theory in ultra-dense networks