Tackling the Abstraction and Reasoning Corpus (ARC) with Object-centric Models and the MDL Principle

Sébastien Ferré
2023-11-01
Abstract:The Abstraction and Reasoning Corpus (ARC) is a challenging benchmark, introduced to foster AI research towards human-level intelligence. It is a collection of unique tasks about generating colored grids, specified by a few examples only. In contrast to the transformation-based programs of existing work, we introduce object-centric models that are in line with the natural programs produced by humans. Our models can not only perform predictions, but also provide joint descriptions for input/output pairs. The Minimum Description Length (MDL) principle is used to efficiently search the large model space. A diverse range of tasks are solved, and the learned models are similar to the natural programs. We demonstrate the generality of our approach by applying it to a different domain.
Artificial Intelligence
What problem does this paper attempt to address?
The paper attempts to address the problem of improving the performance of AI systems on abstract reasoning tasks in the Abstraction and Reasoning Corpus (ARC) benchmark by using object-centric models and the Minimum Description Length (MDL) principle. Specifically, ARC is a challenging benchmark set designed to promote AI research towards human-level intelligence. Existing methods are primarily based on transformation programs, whereas this paper proposes a new approach, namely object-centric models, which are closer to human-generated programs and can not only make predictions but also provide joint descriptions of input-output pairs. The contributions proposed by the authors include: 1. **Object-Centric Models**: These models can parse and generate grids centered on object patterns and computations on objects, which is more aligned with human problem-solving thinking. 2. **Efficient Search under the MDL Principle**: Utilizing the MDL principle to efficiently search the large space of object-centric models, thereby finding the models that best compress the data. The paper demonstrates the effectiveness of this approach across various tasks and proves its generality, such as its application to the task of automatically filling spreadsheet columns. This approach achieves better results on ARC tasks compared to existing transformation-based methods and is closer to human-generated programs.