Open Problem: Is There an Online Learning Algorithm That Learns Whenever Online Learning Is Possible?

Steve Hanneke
DOI: https://doi.org/10.48550/arXiv.2107.09542
2021-07-20
Abstract:This open problem asks whether there exists an online learning algorithm for binary classification that guarantees, for all target concepts, to make a sublinear number of mistakes, under only the assumption that the (possibly random) sequence of points X allows that such a learning algorithm can exist for that sequence. As a secondary problem, it also asks whether a specific concise condition completely determines whether a given (possibly random) sequence of points X admits the existence of online learning algorithms guaranteeing a sublinear number of mistakes for all target concepts.
Machine Learning,Artificial Intelligence,Probability,Statistics Theory
What problem does this paper attempt to address?
The problems that this paper attempts to solve can be summarized into two main open problems: 1. **Does there exist an optimistically generally applicable online learning algorithm?** - The paper proposes that the first open problem is about whether there exists an "optimistically generally applicable online learning algorithm". This algorithm can ensure that the growth rate of the number of its prediction errors is slower than linear under any random sequence \(X\) that may enable general online learning, that is, the number of errors is \(o(T)\), where \(T\) is the time step. The weak form and the strong form of this problem require the conditions to be satisfied in expectation and almost surely (a.s.), respectively. - The author offers a bonus for this problem to encourage researchers to solve it. Solutions, either positive or negative, can receive the bonus. 2. **When is general online learning possible?** - The second open problem is about determining which random sequences \(X\) can make general online learning possible. The paper defines two sets \(C_w\) and \(C_s\), which correspond to the conditions for weak general online learning and strong general online learning to be possible, respectively. Specifically: - \(C_w\) contains all random sequences \(X\) that satisfy \(E[|\{i\in\mathbb{N}:X_1^T\cap A_i\neq\emptyset\}|]=o(T)\) for any disjoint sequence of measurable sets \(\{A_i\}_{i\in\mathbb{N}}\). - \(C_s\) contains all random sequences \(X\) that satisfy \(|\{i\in\mathbb{N}:X_1^T\cap A_i\neq\emptyset\}| = o(T)\) almost surely for any disjoint sequence of measurable sets \(\{A_i\}_{i\in\mathbb{N}}\). - The author's questions are: Is \(C_w\) equal to the set of all random sequences \(X\) that make weak general online learning possible? Is \(C_s\) equal to the set of all random sequences \(X\) that make strong general online learning possible? - The author also offers a bonus for solving these problems to encourage researchers to explore whether these conditions are sufficient and necessary. The solutions to these two problems can not only deepen the understanding of online learning theory but also may provide new ideas for designing more effective online learning algorithms.