The Challenges of Machine Learning: A Critical Review

Enrico Barbierato,Alice Gatti
DOI: https://doi.org/10.3390/electronics13020416
IF: 2.9
2024-01-20
Electronics
Abstract:The concept of learning has multiple interpretations, ranging from acquiring knowledge or skills to constructing meaning and social development. Machine Learning (ML) is considered a branch of Artificial Intelligence (AI) and develops algorithms that can learn from data and generalize their judgment to new observations by exploiting primarily statistical methods. The new millennium has seen the proliferation of Artificial Neural Networks (ANNs), a formalism able to reach extraordinary achievements in complex problems such as computer vision and natural language recognition. In particular, designers claim that this formalism has a strong resemblance to the way the biological neurons operate. This work argues that although ML has a mathematical/statistical foundation, it cannot be strictly regarded as a science, at least from a methodological perspective. The main reason is that ML algorithms have notable prediction power although they cannot necessarily provide a causal explanation about the achieved predictions. For example, an ANN could be trained on a large dataset of consumer financial information to predict creditworthiness. The model takes into account various factors like income, credit history, debt, spending patterns, and more. It then outputs a credit score or a decision on credit approval. However, the complex and multi-layered nature of the neural network makes it almost impossible to understand which specific factors or combinations of factors the model is using to arrive at its decision. This lack of transparency can be problematic, especially if the model denies credit and the applicant wants to know the specific reasons for the denial. The model's "black box" nature means it cannot provide a clear explanation or breakdown of how it weighed the various factors in its decision-making process. Secondly, this work rejects the belief that a machine can simply learn from data, either in supervised or unsupervised mode, just by applying statistical methods. The process of learning is much more complex, as it requires the full comprehension of a learned ability or skill. In this sense, further ML advancements, such as reinforcement learning and imitation learning denote encouraging similarities to similar cognitive skills used in human learning.
engineering, electrical & electronic,computer science, information systems,physics, applied
What problem does this paper attempt to address?
The paper attempts to address several key issues in the field of Machine Learning (ML): 1. **Effectiveness of ML as a Science**: The author argues that although ML has strong predictive capabilities, it lacks causal explanatory power, which prevents it from being strictly considered a science at the methodological level. For example, in credit scoring, neural networks can provide scoring results but cannot explain which specific factors influenced the score. 2. **Exaggeration of the Concept of "Machine Learning"**: The paper points out that the term "machine learning" exaggerates its actual functionality because machines do not truly understand what they learn; they are merely trained to complete specific tasks, whereas human learning involves an understanding of skills. 3. **Questioning the Existing Technological Foundation**: The author cautions readers to be skeptical about the innovativeness of ML technologies, as many techniques actually originate from statistics and are not new inventions. 4. **Issues of Data Quality and Validity**: Even after model training is completed, its performance cannot be guaranteed to be effective because the training data may be imbalanced, biased, or insufficient. 5. **Exploration of Human-like Learning Methods**: The paper also discusses some new ML technologies such as Reinforcement Learning (RL) and Imitation Learning (IL), which have similarities to learning methods in human cognitive science. Overall, this paper aims to explore the nature and scientific status of ML from a philosophical and epistemological perspective, highlighting the strengths and limitations of current ML technologies and proposing directions for future research.