Deep Reinforcement Learning-based Dynamic Bandwidth Allocation in Weighted Fair Queues of Routers.

Jinyan Pan,Gang Chen,Haoran Wu,Xi Peng,Li Xia
DOI: https://doi.org/10.1109/case49997.2022.9926628
2022-01-01
Abstract:Motivated by a real problem of service mechanism in the output port of routers, this paper studies the dynamic bandwidth allocation in a G/G/1/K parallel queueing system, where weighted fair queueing (WFQ) scheduling discipline is applied to support differentiated services for different packet queues. The bursty and complicated characteristics of Internet traffic pose a challenge on the analytic solution for dynamic bandwidth allocation, which requires distributional information of traffic patterns. Since the distributional information of Internet traffic is always unavailable and varied with time, we propose a deep reinforcement learning (DRL) framework to train a bandwidth controller by adaptively interacting with the environment. The controller dynamically allocates bandwidth weights among multiple queues according to the instant queue lengths observed. We train the controller with two advanced DRL algorithms, DDPG and SAC, respectively. With real traffic data, experiment results show that our trained controllers achieve a lower average delay and packet loss rate than a rule-based policy. Our proposed WFQ-DRL algorithm is a first attempt to apply RL algorithms in real scenarios of routers, where the system has eight or more queues and a diversity of real traffic without Poisson assumption is applicable.
What problem does this paper attempt to address?