Dynamic Regret with Unknown Delays
Peixuan Wu,Zhengyang Liu,Hui Lü,Heyan Huang
DOI: https://doi.org/10.2139/ssrn.4679653
2023-01-01
Abstract:In this paper, we delve into the realm of delayed online convex optimization, addressing scenarios where unknown delays exist between the querying and revealing of feedback. Obtaining dynamic regret in this case poses challenges arising from the uncertainties associated with both comparator sequence and delay sequence. First, to handle the uncertainty of comparator sequence, we introduce an adaptive delayed algorithm, yielding an $\mathcal{O}(\max{\sqrt{(1+\Delta_T)D_T},(\hat{d}-1)\Delta_T})$ dynamic regret bound for arbitrary comparator sequences. Here, $\Delta_T$ represents the path-length of the comparator sequence, $\hat{d}$ denotes the maximum delay, and $D_T$ signifies the cumulative delay over the entire horizon. Our approach centers around employing a delayed meta algorithm to track parallel delayed expert algorithms with varying step sizes. To complement our theoretical analysis, we establish an $\Omega(\sqrt{(1+\Delta_T)D_T})$ lower bound. Furthermore, to address the uncertainty of delay sequence, we extend our adaptive algorithm to scenarios where the time horizon and cumulative delay are unknown, incorporating a delayed doubling trick tailored to the number of missing feedback instances, thereby achieving a comparable dynamic regret. Finally, numerical simulations are presented to validate the effectiveness of our proposed approach.