Abstract:The collision avoidance mechanism adopted by the IEEE 802.11 standard is not optimal. The mechanism employs a binary exponential backoff (BEB) algorithm in the medium access control (MAC) layer. Such an algorithm increases the backoff interval whenever a collision is detected to minimize the probability of subsequent collisions. However, the increase of the backoff interval causes degradation of the radio spectrum utilization (i.e., bandwidth wastage). That problem worsens when the network has to manage the channel access to a dense number of stations, leading to a dramatic decrease in network performance. Furthermore, a wrong backoff setting increases the probability of collisions such that the stations experience numerous collisions before achieving the optimal backoff value. Therefore, to mitigate bandwidth wastage and, consequently, maximize the network performance, this work proposes using reinforcement learning (RL) algorithms, namely Deep Q Learning (DQN) and Deep Deterministic Policy Gradient (DDPG), to tackle such an optimization problem. In our proposed approach, we assess two different observation metrics, the average of the normalized level of the transmission queue of all associated stations and the probability of collisions. The overall network’s throughput is defined as the reward. The action is the contention window (CW) value that maximizes throughput while minimizing the number of collisions. As for the simulations, the NS-3 network simulator is used along with a toolkit known as NS3-gym, which integrates a reinforcement-learning (RL) framework into NS-3. The results demonstrate that DQN and DDPG have much better performance than BEB for both static and dynamic scenarios, regardless of the number of stations. Additionally, our results show that observations based on the average of the normalized level of the transmission queues have a slightly better performance than observations based on the collision probability. Moreover, the performance difference with BEB is amplified as the number of stations increases, with DQN and DDPG showing a 45.52% increase in throughput with 50 stations.

Online Policies for Throughput Maximization of Backscatter Assisted Wireless Powered Communication Via Reinforcement Learning Approaches.

Online Policies for Throughput Maximization of Energy-Constrained Wireless-Powered Communication Systems

Throughput Maximization for Ambient Backscatter Communication: A Reinforcement Learning Approach

Optimal Online Transmission Policy for Energy-Constrained Wireless-Powered Communication Networks.

Long-Term Throughput Maximization in Wireless Powered Communication Networks: A Multi-Task DRL Approach

Power Allocation for Full-Duplex Communication Systems Based on Deep Deterministic Policy Gradient

Deep Reinforcement Learning for Energy Efficiency Maximization in SWIPT-Based Over-the-Air Federated Learning

Reinforcement Learning Based Power Control for Reliable Mission-Critical Wireless Transmission

Joint Throughput Maximization and Energy Management for Ultra-low Power Ambient Backscatter Communication in WBANs by Distributed Deep Reinforcement Learning

Adaptive transmission scheduling over fading channels for energy-efficient cognitive radio networks by reinforcement learning

D2PG: deep deterministic policy gradient based for maximizing network throughput in clustered EH-WSN

Deep Reinforcement Learning Optimal Transmission Algorithm for Cognitive Internet of Things with RF Energy Harvesting

A Hybrid Communication Scheme for Throughput Maximization in Backscatter-Aided Energy Harvesting Cognitive Radio Networks

Online Power Control and Optimization for Energy Harvesting Communication System Based on State of Charge

An Online Adaptive Bandwidth Allocation Optimization Algorithm for Wireless Multimedia Communication Networks

Transmission with Energy Harvesting Nodes in Fading Wireless Channels: Optimal Policies

Deep Reinforcement Learning-based Power Control and Bandwidth Allocation Policy for Weighted Cost Minimization in Wireless Networks

Power Control for Wireless VBR Video Streaming: From Optimization to Reinforcement Learning

Deep Reinforcement Learning-Assisted Age-optimal Transmission Policy for HARQ-aided NOMA Networks

Reinforcement Learning-based Wi-Fi Contention Window Optimization

Deep Reinforcement Learning Empowered Rate Selection of XP-HARQ