Information Directed Learning Algorithm for Minimizing Queue Length Regret

Xinyu Hou,Haoyue Tang,Jintao Wang,Jian Song
DOI: https://doi.org/10.1109/bmsb55706.2022.9828660
2022-01-01
Abstract:In this paper, we consider a discrete time transmitter-receiver pair with $K$ error-prone transmission channels. In each slot, packets arrive at the transmitter randomly and wait in the queue before they are successfully delivered to the receiver. The goal is to design an adaptive channel selection strategy to minimize the queue length over $T$ consecutive slots in the absence of packet-loss probabilities. We categorize the current queueing status into two classes: (1) when the queue is empty, we fully explore all the channels through uniform sampling; (2) when there are untransmitted packets in the queue, we balance the explore-exploit trade-off using information directed sampling. We prove that the proposed algorithm reaches a time cumulative queue length regret of order $\mathcal{O}(1)$ . Simulation results validate the effectiveness of the proposed algorithm.
What problem does this paper attempt to address?