Abstract:Compositional generalization is the ability of a model to generalize to complex, previously unseen types of combinations of entities from just having seen the primitives. This type of generalization is particularly relevant to the semantic parsing community for applications such as task-oriented dialogue, text-to-SQL parsing, and information retrieval, as they can harbor infinite complexity. Despite the success of large language models (LLMs) in a wide range of NLP tasks, unlocking perfect compositional generalization still remains one of the few last unsolved frontiers. The past few years has seen a surge of interest in works that explore the limitations of, methods to improve, and evaluation metrics for compositional generalization capabilities of LLMs for semantic parsing tasks. In this work, we present a literature survey geared at synthesizing recent advances in analysis, methods, and evaluation schemes to offer a starting point for both practitioners and researchers in this area.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is: how to achieve the compositional generalization ability in semantic parsing in large - scale language models (LLMs). Specifically, the paper focuses on how to enable these models to handle complex and previously unseen combinations of entities, rather than just being trained based on known basic elements. ### Problem Background Compositional generalization refers to the ability of a model to generalize to complex, previously unseen combinations of entities when it has only seen the basic elements. This type of generalization is especially important for the semantic parsing community because tasks such as task - oriented dialogue, text - to - SQL parsing, and information retrieval that it involves may have infinite complexity. Although large - scale language models have achieved success in a wide range of natural language processing tasks, they still face challenges in unlocking perfect compositional generalization. ### Paper Goals The main goal of the paper is to review the research progress in recent years on the compositional generalization ability of large - scale language models in semantic parsing tasks, including analysis methods, improvement methods, and evaluation metrics. By synthesizing these studies, it provides a starting point for practitioners and researchers to better understand and solve this problem. ### Main Problems 1. **Limitations of Existing Models**: Although large - scale language models perform well in standard benchmark tests, careful error analysis shows that these models perform poorly when dealing with symbolic forms containing nested structures, especially lacking generalization ability when the depth exceeds that of the training data. 2. **Crudeness of Evaluation Methods**: Existing benchmark corpora can only provide a rough assessment of the underlying capabilities of the model and cannot reflect its compositional generalization ability in detail. Therefore, the research community is committed to developing more targeted training - test splits for more fine - grained experiments. 3. **Definition and Evaluation of Compositional Generalization**: Different research works have different definitions and evaluation methods of compositional generalization, which makes it difficult to directly compare the results across different datasets. For this reason, researchers have proposed systematic methods to quantify the degree of compositional generalization. 4. **Methods for Improving Compositional Generalization**: The paper explores a variety of methods for improving compositional generalization, including data augmentation, neuro - symbolic modeling, and prompt - based techniques, etc. ### Conclusion The paper summarizes the current challenges of large - scale language models in compositional generalization and points out the directions for future research. Although these models perform well in many NLP tasks, compositional generalization is still a problem that needs to be urgently solved. By comprehensively analyzing existing research, the paper provides valuable references for researchers in the field of semantic parsing, helping them further explore and improve the methods and techniques in this field.

Towards Compositionally Generalizable Semantic Parsing in Large Language Models: A Survey

Compositional Generalization and Natural Language Variation: Can a Semantic Parsing Approach Handle Both?

A Survey on Semantic Parsing from the perspective of Compositionality

Explaining Large Language Model-Based Neural Semantic Parsers (Student Abstract)

Compositional Generalization in Multilingual Semantic Parsing over Wikidata

Evaluating Morphological Compositional Generalization in Large Language Models

Do Large Language Models Have Compositional Ability? An Investigation into Limitations and Scalability

Rethinking Semantic Parsing for Large Language Models: Enhancing LLM Performance with Semantic Hints

A Survey of Semantic Parsing Techniques

Compositionality as Lexical Symmetry

Compositional Generalization with Grounded Language Models

Large Language Models for Data Annotation and Synthesis: A Survey

Large Language Models Meet NLP: A Survey

Semantic Composition in Visually Grounded Language Models

A Survey on Compositional Learning of AI Models: Theoretical and Experimental Practices

A Survey of Large Language Models on Generative Graph Analytics: Query, Learning, and Applications

Improving Generalization in Semantic Parsing by Increasing Natural Language Variation

Efficient Large Language Models: A Survey

Analyzing the Role of Semantic Representations in the Era of Large Language Models

Learning from syntax generalizations for automatic semantic annotation

Conversational Semantic Parsing