Abstract:We develop algorithms for online linear regression which achieve optimal static and dynamic regret guarantees \emph{even in the complete absence of prior knowledge}. We present a novel analysis showing that a discounted variant of the Vovk-Azoury-Warmuth forecaster achieves dynamic regret of the form $R_{T}(\vec{u})\le O\left(d\log(T)\vee \sqrt{dP_{T}^{\gamma}(\vec{u})T}\right)$, where $P_{T}^{\gamma}(\vec{u})$ is a measure of variability of the comparator sequence, and show that the discount factor achieving this result can be learned on-the-fly. We show that this result is optimal by providing a matching lower bound. We also extend our results to \emph{strongly-adaptive} guarantees which hold over every sub-interval $[a,b]\subseteq[1,T]$ simultaneously.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to achieve optimal static and dynamic regret guarantees in the absence of any prior knowledge when performing online linear regression in a dynamic environment. Specifically, the authors propose new algorithms and techniques, enabling the online linear regression model to maintain good prediction performance even when the data distribution changes over time. ### Main Problem Description 1. **Online Linear Regression in a Dynamic Environment**: - Online linear regression is a classic least - squares regression problem, but is applicable to streaming data. In each round, the learner needs to predict the target value $\hat{y}_t$ based on the feature vector $x_t$ before observing the target signal $y_t$. - A dynamic environment means that the data - generating distribution may change over time, which makes the traditional static regret analysis no longer applicable. 2. **Requirement of No Prior Knowledge**: - Many existing online linear regression algorithms rely on some assumptions or prior knowledge about the data, such as data range, distribution, etc. However, in practical applications, this prior knowledge is often unavailable. - The goal of this paper is to develop an algorithm that does not require any prior knowledge while still providing strong performance guarantees. ### Solutions 1. **Discounted Vovk - Azoury - Warmuth (VAW) Predictor**: - A discounted version of the VAW predictor is proposed. By introducing the discount factor $\gamma$, the algorithm can focus more on the most recent data, thus better adapting to changes in the data distribution. - The formula is expressed as: \[ w_t=\left(\gamma^t\lambda I+\sum_{s = 1}^{t}\gamma^{t - s}x_sx_s^{\top}\right)^{-1}\left(\tilde{y}_t x_t+\gamma^{t - 1}\theta_t\right) \] 2. **Dynamic Regret Guarantee**: - It is proved that the discounted VAW predictor can achieve dynamic regret $R_T(u)\leq O\left(\sqrt{dP_\gamma^T(u)T}\right)$, where $P_\gamma^T(u)$ is the variability measure of the comparison sequence $u$. - A matching lower bound $R_T(u)\geq\Omega\left(\sqrt{dP_\gamma^T(u)T}\right)$ is further provided, proving the optimality of this result. 3. **Adaptability and Small - Loss Guarantee**: - By introducing the prediction hint $\tilde{y}_t$, the algorithm can automatically perform better on "easy" data, that is, when the loss of the comparison sequence $u$ is low, the dynamic regret will also decrease accordingly. - The specific formula is: \[ R_T(u)\leq\tilde{O}\left(P_\gamma^T(u)+\sqrt{P_\gamma^T(u)T\sum_{t = 1}^T\ell_t(u_t)}\right) \] 4. **Learning of the Discount Factor**: - A method is proposed so that the discount factor $\gamma$ can be automatically learned during the running process, thus not requiring the optimal $\gamma$ value to be known in advance. ### Summary This paper solves the key problem in online linear regression in a dynamic environment, that is, how to achieve optimal dynamic regret guarantees without prior knowledge, by introducing the discount factor and the improved VAW predictor. This provides a powerful method for processing non - stationary data streams and has broad application prospects in practical applications.

Online Linear Regression in Dynamic Environments via Discounting

Adaptive Online Learning in Dynamic Environments.

An Equivalence Between Static and Dynamic Regret Minimization

Projection-free Online Learning in Dynamic Environments

Dynamic Regret of Convex and Smooth Functions

Minimizing Dynamic Regret and Adaptive Regret Simultaneously

Adaptivity and Non-stationarity: Problem-dependent Dynamic Regret for Online Convex Optimization

Efficient Methods for Non-stationary Online Learning

On Online Optimization: Dynamic Regret Analysis of Strongly Convex and Smooth Problems

Electronic structure study by means of X-ray spectroscopy and theoretical calculations of the "ferric star" single molecule magnet

A Simple Online Algorithm for Competing with Dynamic Comparators.

Online Optimization With Predictions and Switching Costs: Fast Algorithms and the Fundamental Limit

Tracking Slowly Moving Clairvoyant: Optimal Dynamic Regret of Online Learning with True and Noisy Gradient

Minimizing Adaptive Regret with One Gradient Per Iteration

On the Computational Efficiency of Adaptive and Dynamic Regret Minimization

Discounted Adaptive Online Learning: Towards Better Regularization

Online Convex Optimization with Memory and Limited Predictions

Risk-averse Learning with Non-Stationary Distributions

Nonstationary Nonparametric Online Learning: Balancing Dynamic Regret and Model Parsimony

Non-stationary Projection-free Online Learning with Dynamic and Adaptive Regret Guarantees

Distributionally Time-Varying Online Stochastic Optimization under Polyak-Łojasiewicz Condition with Application in Conditional Value-at-Risk Statistical Learning