Abstract:In this work, we aim to characterize the statistical complexity of realizable regression both in the PAC learning setting and the online learning setting. Previous work had established the sufficiency of finiteness of the fat shattering dimension for PAC learnability and the necessity of finiteness of the scaled Natarajan dimension, but little progress had been made towards a more complete characterization since the work of Simon (SICOMP '97). To this end, we first introduce a minimax instance optimal learner for realizable regression and propose a novel dimension that both qualitatively and quantitatively characterizes which classes of real-valued predictors are learnable. We then identify a combinatorial dimension related to the Graph dimension that characterizes ERM learnability in the realizable setting. Finally, we establish a necessary condition for learnability based on a combinatorial dimension related to the DS dimension, and conjecture that it may also be sufficient in this context. Additionally, in the context of online learning we provide a dimension that characterizes the minimax instance optimal cumulative loss up to a constant factor and design an optimal online learner for realizable regression, thus resolving an open question raised by Daskalakis and Golowich in STOC '22.

What problem does this paper attempt to address?

The core problem that this paper attempts to solve is: **How to represent and design the statistical complexity and optimal learner for realizable real - valued regression, especially in the context of PAC learning and online learning.** Specifically, the authors focus on the following two main issues: 1. **Realizable real - valued regression in PAC learning**: - The authors hope to find the dimension that can represent the learnability of realizable real - valued regression in the PAC learning framework and design the optimal learning algorithm. - Previous research has proven that the finite "fat shattering dimension" is a sufficient condition for PAC learnability, and the finite "scaled Natarajan dimension" is a necessary condition, but there has not been a more complete representation yet. 2. **Realizable real - valued regression in online learning**: - The authors aim to provide a dimension that can represent the minimax instance - optimal cumulative loss and design an optimal online learning algorithm. - This work solves an open problem proposed by Daskalakis and Golowich in STOC '22. ### Main contributions - **Introduced a new minimax instance - optimal learner**: The authors proposed a new minimax instance - optimal learner for realizable real - valued regression and introduced a new dimension to qualitatively and quantitatively represent which real - valued predictors are learnable. - **Identified the combinatorial dimension related to the graph dimension**: This dimension can represent the ERM (empirical risk minimization) learnability in the realizable real - valued regression setting. - **Established the necessary condition for learnability based on the combinatorial dimension related to the DS dimension**: And conjectured that this condition is also sufficient in this context. - **Progress in online learning**: The authors provided a dimension that can represent the minimax instance - optimal cumulative loss and designed an optimal online learning algorithm, thus solving the open problem proposed by Daskalakis and Golowich. ### Related formulas Some of the key definitions and formulas involved in the paper are as follows: - **Absolute loss function**: \[ \ell(x, y)=|x - y|\quad\text{for any}\;x, y\in[0, 1] \] - **Fat Shattering Dimension**: \[ D_{\text{fat}}^\gamma(\mathcal{H})=\max\left\{n:\exists S\subseteq\mathcal{X}, |S| = n, S\text{ is }\gamma\text{ - fat shattered by }\mathcal{H}\right\} \] - **Natarajan Dimension**: \[ D_{\text{Nat}}^\gamma(\mathcal{H})=\max\left\{n:\exists S\subseteq\mathcal{X}, |S| = n, S\text{ is }\gamma\text{ - Natarajan shattered by }\mathcal{H}\right\} \] - **Graph Dimension**: \[ D_G^\gamma(\mathcal{H})=\max\left\{n:\exists S\subseteq\mathcal{X}, |S| = n, S\text{ is }\gamma\text{ - graph shattered by }\mathcal{H}\right\} \] Through the introduction and analysis of these dimensions, the authors hope to fill in the existing theory.

Optimal Learners for Realizable Regression: PAC Learning and Online Learning

Realizable Learning is All You Need

Practical, Provably-Correct Interactive Learning in the Realizable Setting: The Power of True Believers

Is Efficient PAC Learning Possible with an Oracle That Responds 'Yes' or 'No'?

Regularization and Optimal Multiclass Learning

Adversarially Robust PAC Learnability of Real-Valued Functions

Optimality of Robust Online Learning

Online Learning with Set-Valued Feedback

On the Computational Landscape of Replicable Learning

Probably Approximately Precision and Recall Learning

LEARN: An Invex Loss for Outlier Oblivious Robust Online Optimization

Majority-of-Three: The Simplest Optimal Learner?

No-Regret Learnability for Piecewise Linear Losses

A Characterization of List Regression

On the Computability of Robust PAC Learning

Towards Instance-Optimality in Online PAC Reinforcement Learning

A Combinatorial Characterization of Supervised Online Learnability

Sequential Probability Assignment with Contexts: Minimax Regret, Contextual Shtarkov Sums, and Contextual Normalized Maximum Likelihood

Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage

Overcoming Brittleness in Pareto-Optimal Learning-Augmented Algorithms

When is Agnostic Reinforcement Learning Statistically Tractable?