Large Language Models for Generative Information Extraction: A Survey

Derong Xu,Wei Chen,Wenjun Peng,Chao Zhang,Tong Xu,Xiangyu Zhao,Xian Wu,Yefeng Zheng,Yang Wang,Enhong Chen
2024-10-31
Abstract:Information extraction (IE) aims to extract structural knowledge from plain natural language texts. Recently, generative Large Language Models (LLMs) have demonstrated remarkable capabilities in text understanding and generation. As a result, numerous works have been proposed to integrate LLMs for IE tasks based on a generative paradigm. To conduct a comprehensive systematic review and exploration of LLM efforts for IE tasks, in this study, we survey the most recent advancements in this field. We first present an extensive overview by categorizing these works in terms of various IE subtasks and techniques, and then we empirically analyze the most advanced methods and discover the emerging trend of IE tasks with LLMs. Based on a thorough review conducted, we identify several insights in technique and promising research directions that deserve further exploration in future studies. We maintain a public repository and consistently update related works and resources on GitHub (\href{<a class="link-external link-https" href="https://github.com/quqxui/Awesome-LLM4IE-Papers" rel="external noopener nofollow">this https URL</a>}{LLM4IE repository})
Computation and Language
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to address the issue of generating structured information in Information Extraction (IE) tasks using Large Language Models (LLMs). Specifically, the paper focuses on the following aspects: 1. **Limitations of Traditional Methods**: - Traditional IE methods require training independent models for each specific task, leading to high resource consumption and management costs. - Traditional methods perform poorly when dealing with complex and variable domain requirements. 2. **Advantages of LLMs**: - LLMs have powerful text understanding and generation capabilities, able to capture intrinsic patterns and semantic knowledge in text corpora through autoregressive prediction pre-training. - LLMs can perform zero-shot and few-shot learning, maintaining consistent performance across different tasks. - LLMs can act as intelligent agents, utilizing memory retrieval and various tools to improve task execution efficiency. 3. **Need for Generative IE Methods**: - Existing surveys have not fully explored the application of LLMs in generative IE, especially general frameworks and learning techniques in low-resource scenarios. - The community urgently needs a deeper analysis of how to more appropriately apply LLMs to IE tasks to enhance performance in the IE field. 4. **Challenges and Issues**: - There are alignment issues when converting natural language output into structured forms. - LLMs may experience hallucination problems, generating incorrect or irrelevant information. - LLMs are highly context-dependent, require significant computational resources, and have difficulties in updating internal knowledge. ### Main Contributions of the Paper - **Comprehensive Review**: The paper provides a comprehensive review of the latest advancements in LLMs for generative IE, covering different IE sub-tasks and techniques. - **Classification Systems**: Proposes two classification systems, one for categorizing different IE sub-tasks and another for categorizing new methods using LLMs. - **Performance Evaluation**: Conducts experimental analysis of representative methods, comparing their performance on different datasets. - **Future Directions**: Suggests potential research directions, providing guidance for future studies. Through these efforts, the paper aims to provide researchers with a comprehensive reference, promoting further development of LLMs in the field of generative IE.