Abstract:Cellular network scheduling is crucial for wireless deployments like 4G, 5G, and 6G and is a challenging resource allocation task performed by the scheduler located at the base stations. The scheduler must balance two critical metrics, throughput and fairness, which often conflict, as maximizing throughput favors users with better channel conditions, while ensuring fairness requires allocating resources to those with poorer channel conditions. The proportional fairness metric is a prominent scheduling approach that aims to balance these competing metrics with minimal compromise. The common strategy to attain proportional fairness relies on a greedy approach in which each resource block is allocated to the user who maximizes the proportional fairness criterion. With such a strategy, the scheduler can ensure that the resources allocated to the users at each time instance maximize the proportional fairness metric. However, users can usually tolerate some delay and are willing to accept temporary fairness imbalances if they ultimately improve their performance, provided that the fairness criterion is maintained over time. In this paper, we propose a new scheduler that uses reinforcement learning to enhance proportional fairness. The suggested scheduler considers both current and predicted future channel conditions for each user, aiming to maximize the proportional fairness criterion over a set of predefined periodic time epochs. Specifically, by learning patterns in channel fluctuations, our reinforcement learning-based scheduler allocates each resource block not to the user who maximizes the instantaneous proportional fairness metric, but to the user who maximizes the expected proportional fairness metric at the end of the current time epoch. This approach achieves an improved balance between throughput and fairness across multiple slots. Simulations demonstrate that our approach outperforms standard proportional fairness scheduling. We further implemented the proposed scheme on a live 4G eNodeB station and observed similar gains.

Buffer-Aware Wireless Scheduling Based On Deep Reinforcement Learning

Deep-Reinforcement-Learning-Based Scheduling with Contiguous Resource Allocation for Next-Generation Cellular Systems

Deep Reinforcement Learning for Wireless Scheduling in Distributed Networked Control

Deep Reinforcement Learning Based Massive Access Management for Ultra-Reliable Low-Latency Communications

QoS Differentiated and Fair Packet Scheduling in Broadband Wireless Access Networks

A deep reinforcement learning-based D2D spectrum allocation underlaying a cellular network

Delay-Oriented Scheduling in 5G Downlink Wireless Networks Based on Reinforcement Learning With Partial Observations

Power Allocation in Multi-User Cellular Networks: Deep Reinforcement Learning Approaches

Digital twin‐enabled deep reinforcement learning for joint scheduling of ultra‐reliable low latency communication and enhanced mobile broad band: A reliability‐guaranteed approach

Packet Scheduling In Broadband Wireless Networks Using Neuro-Dynamic Programming

Towards Practical Deep Schedulers for Allocating Cellular Radio Resources

Exploring Reinforcement Learning for Scheduling in Cellular Networks

Multi-agent Deep Reinforcement Learning for Cross-Layer Scheduling in Mobile Ad-Hoc Networks

Lyapunov-guided Multi-Agent Reinforcement Learning for Delay-Sensitive Wireless Scheduling

Load-Aware Distributed Resource Allocation for MF-TDMA Ad Hoc Networks: A Multi-Agent DRL Approach.

Deep Reinforcement Learning-Based Adaptive Scheduling for Wireless Time-Sensitive Networking

Exploiting Deep Reinforcement Learning for Edge Caching in Cell-Free Massive MIMO Systems

Traffic Priority-Aware Multi-User Distributed Dynamic Spectrum Access: A Multi-Agent Deep RL Approach

Deep Reinforcement Learning-Based Dynamic Spectrum Access for D2D Communication Underlay Cellular Networks

DRLS: A Deep Reinforcement Learning Based Scheduler for Time-Triggered Ethernet

A2C-DRL: Dynamic Scheduling for Stochastic Edge-Cloud Environments Using A2C and Deep Reinforcement Learning