Neural networks for abstraction and reasoning: Towards broad generalization in machines

Mikel Bober-Irizar,Soumya Banerjee
2024-02-06
Abstract:For half a century, artificial intelligence research has attempted to reproduce the human qualities of abstraction and reasoning - creating computer systems that can learn new concepts from a minimal set of examples, in settings where humans find this easy. While specific neural networks are able to solve an impressive range of problems, broad generalisation to situations outside their training data has proved
Artificial Intelligence,Computation and Language,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the limitations of current artificial intelligence systems in abstraction and reasoning abilities, especially the lack of broad generalization ability of these systems when facing new situations outside the training data. Specifically, although neural networks can solve various specific tasks, in abstract visual reasoning tasks (such as Abstraction & Reasoning Corpus, ARC), existing algorithms are still difficult to achieve broad generalization, that is, the ability to learn new concepts from a small number of examples and apply these concepts in different situations. ARC is a data set containing abstract visual reasoning tasks, aiming to test the broad generalization ability of algorithms. Although the international competition has a prize of up to $100,000, the best algorithm still cannot solve most ARC tasks and relies on complex hand - made rules without using machine - learning methods. For this reason, the authors explored two new directions to solve ARC tasks: 1. **Adaptation of the DreamCoder algorithm**: The authors adapted the DreamCoder algorithm, a recent advanced algorithm in the field of program induction, to solve ARC tasks. They designed a pure functional language named Perceptual Abstraction & Reasoning Language (PeARL), enabling DreamCoder to solve ARC tasks, and proposed a new recognition model, which significantly improved the performance of the previous best implementation. 2. **Application of large - language models**: The authors introduced a new framework for using large - language models (LLMs) to solve ARC tasks, transforming these visual tasks into the text domain. Through detailed evaluation of the performance of three model categories, it was found that LLMs can achieve performance competitive with human - made systems after appropriate data augmentation and domain transformation. In addition, the authors also constructed an integration of multiple ARC solvers, considering heterogeneous performance. This integration method performs better than the current state - of - the - art single system. Finally, the authors released the arckit Python library to promote future research in this field. In conclusion, this paper aims to improve the broad generalization ability of computers in abstraction and reasoning tasks by exploring new machine - learning systems, especially methods combined with neural networks, so as to be closer to the human intelligence level.