A Theory of Machine Learning

Jinsook Kim,Jinho Kang
2024-07-08
Abstract:We critically review three major theories of machine learning and provide a new theory according to which machines learn a function when the machines successfully compute it. We show that this theory challenges common assumptions in the statistical and the computational learning theories, for it implies that learning true probabilities is equivalent neither to obtaining a correct calculation of the true probabilities nor to obtaining an almost-sure convergence to them. We also briefly discuss some case studies from natural language processing and macroeconomics from the perspective of the new theory.
Machine Learning
What problem does this paper attempt to address?
This paper attempts to solve several core problems in machine learning theory and proposes a new machine learning theory. Specifically: 1. **Problems with Existing Theories**: - First, the paper critically reviews three major machine learning theories: the possible - worlds theory, the recognition theory, and the operational theory. - The possible - worlds theory and the recognition theory are both based on epistemological approaches, while the operational theory is based on behavioral approaches. - These theories all have important problems. For example, the possible - worlds theory cannot clearly explain what knowledge is, the recognition theory depends on circular definitions, and the operational theory cannot effectively distinguish different probability measures in practice. 2. **Proposal of the New Theory**: - The paper proposes a new machine learning theory, which holds that a machine learns a function if and only if the machine can successfully compute this function. - This new theory challenges the common assumptions in statistical learning and computational learning theories. In particular, it shows that learning the true probability is not equivalent to correctly computing the true probability or converging almost surely to the true probability. 3. **Implications of the New Theory**: - The new theory emphasizes the success and self - confidence of computation, that is, the machine not only needs to correctly compute the target function, but also needs to be confident that it is correct in most cases. - The paper also discusses two case studies in the fields of natural language processing and macroeconomics to show the applications and limitations of the new theory. 4. **Practical Significance**: - The paper explores how to theoretically draw practical conclusions in actual machine learning algorithms, especially in directly estimating the true probability. - Through case studies, the paper shows that in some cases, such as the N - gram model in natural language processing, the true probability can be learned by machine learning, while in other cases, such as macroeconomic models, the true probability may not be learned by machine learning. In conclusion, this paper aims to provide a more rigorous and practical machine learning theoretical framework to solve the problems existing in the existing theories and provide a new perspective for future machine learning research.