Understand Dynamic Regret with Switching Cost for Online Decision Making

Yawei Zhao,Qian Zhao,Xingxing Zhang,En Zhu,Xinwang Liu,Jianping Yin
DOI: https://doi.org/10.1145/3375788
IF: 5
2020-01-01
ACM Transactions on Intelligent Systems and Technology
Abstract:As a metric to measure the performance of an online method, dynamic regret with switching cost has drawn much attention for online decision making problems. Although the sublinear regret has been provided in much previous research, we still have little knowledge about the relation between the dynamic regret and the switching cost. In the article, we investigate the relation for two classic online settings: Online Algorithms (OA) and Online Convex Optimization (OCO). We provide a new theoretical analysis framework that shows an interesting observation; that is, the relation between the switching cost and the dynamic regret is different for settings of OA and OCO. Specifically, the switching cost has significant impact on the dynamic regret in the setting of OA. But it does not have an impact on the dynamic regret in the setting of OCO. Furthermore, we provide a lower bound of regret for the setting of OCO, which is same with the lower bound in the case of no switching cost. It shows that the switching cost does not change the difficulty of online decision making problems in the setting of OCO.
What problem does this paper attempt to address?