Human-level few-shot concept induction through minimax entropy learning

Chi Zhang,Baoxiong Jia,Yixin Zhu,Song-Chun Zhu
DOI: https://doi.org/10.1126/sciadv.adg2488
IF: 13.6
2024-04-20
Science Advances
Abstract:Humans learn concepts both from labeled supervision and by unsupervised observation of patterns, a process machines are being taught to mimic by training on large annotated datasets—a method quite different from the human pathway, wherein few examples with no supervision suffice to induce an unfamiliar relational concept. We introduce a computational model designed to emulate human inductive reasoning on abstract reasoning tasks, such as those in IQ tests, using a minimax entropy approach. This method combines identifying the most effective constraints on data via minimum entropy with determining the best combination of them via maximum entropy. Our model, which applies this unsupervised technique, induces concepts from just one instance, reaching human-level performance on tasks of Raven's Progressive Matrices (RPM), Machine Number Sense (MNS), and Odd-One-Out (O 3 ). These results demonstrate the potential of minimax entropy learning for enabling machines to learn relational concepts efficiently with minimal input.
multidisciplinary sciences
What problem does this paper attempt to address?
The paper primarily focuses on addressing the issue of concept induction in artificial intelligence systems, specifically how to enable machines to efficiently learn abstract and relational concepts from a small number of examples, similar to humans. Specifically, the goals of the paper include: 1. **Imitating Human Learning Methods**: Unlike current machine learning methods that require large annotated datasets for training, the proposed method aims to mimic the human ability to learn new concepts through a small number of unsupervised observations. 2. **Improving Learning Efficiency**: The research focuses on enhancing the learning efficiency of machine learning algorithms, enabling machines to learn new concepts from just one instance, similar to humans. 3. **Solving Specific Tasks**: The paper highlights three specific tasks—Raven's Progressive Matrices (RPM), Machine Number Sense (MNS), and Odd-One-Out (O3), which are commonly used to measure intelligence levels. The goal is to enable machines to achieve human-level performance on these tasks. 4. **Proposing a New Learning Framework**: To achieve the above goals, the paper introduces a computational model based on "minimax entropy learning," which can identify the most effective constraints from limited data and determine the optimal combination to form an understanding of hidden concepts. In summary, the core contribution of the paper is the proposal of a new learning framework that can significantly improve the performance of machines in handling abstract reasoning tasks, especially in situations with limited data, bringing machines closer to human learning capabilities in concept induction.