Levels of AGI for Operationalizing Progress on the Path to AGI

Meredith Ringel Morris,Jascha Sohl-dickstein,Noah Fiedel,Tris Warkentin,Allan Dafoe,Aleksandra Faust,Clement Farabet,Shane Legg
2024-06-06
Abstract:We propose a framework for classifying the capabilities and behavior of Artificial General Intelligence (AGI) models and their precursors. This framework introduces levels of AGI performance, generality, and autonomy, providing a common language to compare models, assess risks, and measure progress along the path to AGI. To develop our framework, we analyze existing definitions of AGI, and distill six principles that a useful ontology for AGI should satisfy. With these principles in mind, we propose "Levels of AGI" based on depth (performance) and breadth (generality) of capabilities, and reflect on how current systems fit into this ontology. We discuss the challenging requirements for future benchmarks that quantify the behavior and capabilities of AGI models against these levels. Finally, we discuss how these levels of AGI interact with deployment considerations such as autonomy and risk, and emphasize the importance of carefully selecting Human-AI Interaction paradigms for responsible and safe deployment of highly capable AI systems.
Artificial Intelligence
What problem does this paper attempt to address?
This paper presents a framework for categorizing artificial intelligence (AI) models, particularly artificial general intelligence (AGI) and its precursors, in terms of their capabilities, behavior, and autonomy. The paper aims to provide a common language for comparing models, evaluating risks, and measuring progress towards AGI. The authors analyze existing definitions of AGI and extract six principles that should form the basis of a useful AGI taxonomy. They propose an "AGI level" based on the performance depth and breadth, and discuss how current systems fit into this taxonomy. Additionally, the paper discusses the challenges of future benchmarking to quantify the behavior and capabilities of AGI models, and explores the interaction between these AGI levels and deployment considerations such as autonomy and risk, emphasizing the importance of responsible and safe human-computer interaction paradigms for deploying highly capable AI systems.