Abstract:Compositionality has long been considered a key explanatory property underlying human intelligence: arbitrary concepts can be composed into novel complex combinations, permitting the acquisition of an open ended, potentially infinite expressive capacity from finite learning experiences. Influential arguments have held that neural networks fail to explain this aspect of behavior, leading many to dismiss them as viable models of human cognition. Over the last decade, however, modern deep neural networks (DNNs), which share the same fundamental design principles as their predecessors, have come to dominate artificial intelligence, exhibiting the most advanced cognitive behaviors ever demonstrated in machines. In particular, large language models (LLMs), DNNs trained to predict the next word on a large corpus of text, have proven capable of sophisticated behaviors such as writing syntactically complex sentences without grammatical errors, producing cogent chains of reasoning, and even writing original computer programs -- all behaviors thought to require compositional processing. In this chapter, we survey recent empirical work from machine learning for a broad audience in philosophy, cognitive science, and neuroscience, situating recent breakthroughs within the broader context of philosophical arguments about compositionality. In particular, our review emphasizes two approaches to endowing neural networks with compositional generalization capabilities: (1) architectural inductive biases, and (2) metalearning, or learning to learn. We also present findings suggesting that LLM pretraining can be understood as a kind of metalearning, and can thereby equip DNNs with compositional generalization abilities in a similar way. We conclude by discussing the implications that these findings may have for the study of compositionality in human cognition and by suggesting avenues for future research.

Development of Compositionality and Generalization through Interactive Learning of Language and Action of Robots

Compositional learning of functions in humans and machines

Exploring the Effect of Primitives for Compositional Generalization in Vision-and-Language

Compositional generalization through abstract representations in human and artificial neural networks

Programmatically Grounded, Compositionally Generalizable Robotic Manipulation

Exploring the acquisition and production of grammatical constructions through human-robot interaction with echo state networks

Compositional Learning of Human Activities With a Self-Organizing Neural Architecture

From Frege to chatGPT: Compositionality in language, cognition, and deep neural networks

Compositionality and Generalization in Emergent Languages

Natural language instructions induce compositional generalization in networks of neurons

Learning Neuro-symbolic Programs for Language Guided Robot Manipulation

$π_0$: A Vision-Language-Action Flow Model for General Robot Control

Improving Compositional Generalization Using Iterated Learning and Simplicial Embeddings

Learning and Compositionality: a Unification Attempt via Connectionist Probabilistic Programming

Enhancing Interpretability and Interactivity in Robot Manipulation: A Neurosymbolic Approach

Interactive Robot Learning of Gestures, Language and Affordances

Compositional Generalization by Learning Analytical Expressions.

A Study of Compositional Generalization in Neural Models

A Survey on Compositional Learning of AI Models: Theoretical and Experimental Practices

Iterated Learning Improves Compositionality in Large Vision-Language Models

On the Correspondence between Compositionality and Imitation in Emergent Neural Communication