A Simple Online Algorithm for Competing with Dynamic Comparators.

Yu-Jie Zhang,Peng Zhao,Zhi-Hua Zhou
2020-01-01
Abstract:Online learning in dynamic environments has recently drawn considerable attention, where dynamic regret is usually employed to compare decisions of online algorithms to dynamic comparators. In previous works, dynamic regret bounds are typically established in terms of regularity of comparators C-T or that of online functions V-T. Recently, Jadbabaie et al. [2015] propose an algorithm that can take advantage of both regularities and enjoy an (O) over tilde(root 1 + D-T + min{root(1 + D-T)C-T, (1+D-T)(VTT1/3)-V-1/3-T-1/3}) dynamic regret, where D-T is an additional quantity to measure the niceness of environments. The regret bound adapts to the smaller regularity of problem environments and is tighter than all existing dynamic regret guarantees. Nevertheless, their algorithm involves non-convex programming at each iteration, and thus requires burdensome computations. In this paper, we design a simple algorithm based on the online ensemble, which provably enjoys the same (even slightly stronger) guarantee as the state-of-the-art rate, yet is much more efficient because our algorithm does not involve any non-convex problem solving. Empirical studies also verify the efficacy and efficiency.
What problem does this paper attempt to address?