Abstract:We critically review three major theories of machine learning and provide a new theory according to which machines learn a function when the machines successfully compute it. We show that this theory challenges common assumptions in the statistical and the computational learning theories, for it implies that learning true probabilities is equivalent neither to obtaining a correct calculation of the true probabilities nor to obtaining an almost-sure convergence to them. We also briefly discuss some case studies from natural language processing and macroeconomics from the perspective of the new theory.

What problem does this paper attempt to address?

This paper attempts to solve several core problems in machine learning theory and proposes a new machine learning theory. Specifically: 1. **Problems with Existing Theories**: - First, the paper critically reviews three major machine learning theories: the possible - worlds theory, the recognition theory, and the operational theory. - The possible - worlds theory and the recognition theory are both based on epistemological approaches, while the operational theory is based on behavioral approaches. - These theories all have important problems. For example, the possible - worlds theory cannot clearly explain what knowledge is, the recognition theory depends on circular definitions, and the operational theory cannot effectively distinguish different probability measures in practice. 2. **Proposal of the New Theory**: - The paper proposes a new machine learning theory, which holds that a machine learns a function if and only if the machine can successfully compute this function. - This new theory challenges the common assumptions in statistical learning and computational learning theories. In particular, it shows that learning the true probability is not equivalent to correctly computing the true probability or converging almost surely to the true probability. 3. **Implications of the New Theory**: - The new theory emphasizes the success and self - confidence of computation, that is, the machine not only needs to correctly compute the target function, but also needs to be confident that it is correct in most cases. - The paper also discusses two case studies in the fields of natural language processing and macroeconomics to show the applications and limitations of the new theory. 4. **Practical Significance**: - The paper explores how to theoretically draw practical conclusions in actual machine learning algorithms, especially in directly estimating the true probability. - Through case studies, the paper shows that in some cases, such as the N - gram model in natural language processing, the true probability can be learned by machine learning, while in other cases, such as macroeconomic models, the true probability may not be learned by machine learning. In conclusion, this paper aims to provide a more rigorous and practical machine learning theoretical framework to solve the problems existing in the existing theories and provide a new perspective for future machine learning research.

A Theory of Machine Learning

A General Theory for Training Learning Machine

An Intelligent Model with Chaos and Causality

Can Machines Learn the True Probabilities?

Information-Theoretic Foundations for Machine Learning

The Challenges of Machine Learning: A Critical Review

Beneficial and Harmful Explanatory Machine Learning

Machine Learning from Theory to Algorithms: An Overview

Information Theory and its Relation to Machine Learning

A Probabilistic Theory of Deep Learning

Machine Learning and Computational Mathematics

The Computational Principles of Learning Ability

Machine learning for decision-making under uncertainty

Generalization in Machine Learning via Analytical Learning Theory

Machine Learning and Theory Ladenness -- A Phenomenological Account

Competitive Machine Learning: Best Theoretical Prediction vs Optimization

Toward a `Standard Model' of Machine Learning

Reconciling modern machine-learning practice and the classical bias–variance trade-off

Machine Learning: An Applied Econometric Approach

Learning principle and mathematical realization of the learning mechanism in the brain

A category theory approach to the semiotics of machine learning