AI-Driven Resource Allocation in Optical Wireless Communication Systems

Abdelrahman S. Elgamal,Osama Z. Aletri,Barzan A. Yosuf,Ahmad Adnan Qidan,Taisir El-Gorashi,Jaafar M. H. Elmirghani
2023-04-08
Abstract:Visible light communication (VLC) is a promising solution to satisfy the extreme demands of emerging applications. VLC offers bandwidth that is orders of magnitude higher than what is offered by the radio spectrum, hence making best use of the resources is not a trivial matter. There is a growing interest to make next generation communication networks intelligent using AI based tools to automate the resource management and adapt to variations in the network automatically as opposed to conventional handcrafted schemes based on mathematical models assuming prior knowledge of the network. In this article, a reinforcement learning (RL) scheme is developed to intelligently allocate resources of an optical wireless communication (OWC) system in a HetNet environment. The main goal is to maximise the total reward of the system which is the sum rate of all users. The results of the RL scheme are compared with that of an optimization scheme that is based on Mixed Integer Linear Programming (MILP) model.
Signal Processing
What problem does this paper attempt to address?
This paper aims to solve the resource allocation problem in optical wireless communication systems (OWC), especially in the heterogeneous network (HetNet) environment. With the growth of the demand for high - data - rate communication, visible light communication (VLC) has become a promising solution because the bandwidth it provides far exceeds that of the radio spectrum. However, how to effectively utilize these resources to meet the needs of multiple users and maximize the overall performance of the system (such as the total rate) is a non - trivial problem. A reinforcement - learning - based (RL) method is proposed in the paper to intelligently allocate resources in OWC systems. The goal of this method is to optimize resource allocation by maximizing the total rewards (i.e., total rate) of all users in the HetNet environment. Compared with traditional manual schemes based on mathematical models, the RL method can automatically adapt to network changes without prior knowledge of the specific situation of the network. The paper also compares the results of the RL scheme with the optimization scheme based on the mixed - integer linear programming (MILP) model to evaluate its performance.