LEARN: An Invex Loss for Outlier Oblivious Robust Online Optimization

Adarsh Barik,Anand Krishna,Vincent Y. F. Tan
2024-08-13
Abstract:We study a robust online convex optimization framework, where an adversary can introduce outliers by corrupting loss functions in an arbitrary number of rounds k, unknown to the learner. Our focus is on a novel setting allowing unbounded domains and large gradients for the losses without relying on a Lipschitz assumption. We introduce the Log Exponential Adjusted Robust and iNvex (LEARN) loss, a non-convex (invex) robust loss function to mitigate the effects of outliers and develop a robust variant of the online gradient descent algorithm by leveraging the LEARN loss. We establish tight regret guarantees (up to constants), in a dynamic setting, with respect to the uncorrupted rounds and conduct experiments to validate our theory. Furthermore, we present a unified analysis framework for developing online optimization algorithms for non-convex (invex) losses, utilizing it to provide regret bounds with respect to the LEARN loss, which may be of independent interest.
Machine Learning,Optimization and Control
What problem does this paper attempt to address?
This paper attempts to address the robustness issue in the Online Convex Optimization (OCO) framework, particularly how to achieve sublinear clean dynamic regret in the presence of an unknown number of outliers. Specifically, the paper investigates how to design a robust optimization algorithm that can maintain good performance even without prior knowledge of the number of outliers, when the adversary can arbitrarily tamper with the loss functions of certain rounds. The main contributions of the paper include: 1. **Introduction of Robust Non-Convex Loss Function**: A loss function named LEARN (Log Exponential Adjusted Robust and iNvex) is proposed to handle the impact of outliers and allows for rigorous theoretical analysis. 2. **Unified Analytical Framework**: A robust online gradient descent algorithm based on the LEARN loss function is developed, and a unified analytical framework is provided to prove its regret upper bounds in different scenarios. 3. **Sublinear Clean Dynamic Regret**: In a bounded domain, the proposed algorithm achieves sublinear growth of clean dynamic regret with respect to the time horizon T and the number of outliers k. 4. **Extension to Unbounded Domain**: Using the expert framework method, sublinear clean dynamic regret is also achieved in unbounded domains, providing the first dynamic regret bounds for unbounded domains with outliers. Through theoretical analysis and experimental validation, the paper demonstrates the effectiveness and robustness of the LEARN algorithm in the presence of outliers, especially in handling online ridge regression and online support vector machine tasks.