Decompose, Enrich, and Extract! Schema-aware Event Extraction using LLMs

Fatemeh Shiri,Van Nguyen,Farhad Moghimifar,John Yoo,Gholamreza Haffari,Yuan-Fang Li
2024-06-03
Abstract:Large Language Models (LLMs) demonstrate significant capabilities in processing natural language data, promising efficient knowledge extraction from diverse textual sources to enhance situational awareness and support decision-making. However, concerns arise due to their susceptibility to hallucination, resulting in contextually inaccurate content. This work focuses on harnessing LLMs for automated Event Extraction, introducing a new method to address hallucination by decomposing the task into Event Detection and Event Argument Extraction. Moreover, the proposed method integrates dynamic schema-aware augmented retrieval examples into prompts tailored for each specific inquiry, thereby extending and adapting advanced prompting techniques such as Retrieval-Augmented Generation. Evaluation findings on prominent event extraction benchmarks and results from a synthesized benchmark illustrate the method's superior performance compared to baseline approaches.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
This paper aims to address the hallucination problem of large language models (LLMs) in the task of Event Extraction (EE) and improve their application effectiveness in military decision support. Specifically, the authors propose a new method that decomposes the event extraction task into two subtasks: Event Detection (ED) and Event Argument Extraction (EAE). By combining dynamic pattern-aware enhanced retrieval examples to optimize prompts, the method reduces the occurrence of inaccurate or fabricated information generated by LLMs. This approach not only improves the accuracy and relevance of event extraction but also demonstrates the effectiveness of automatically retrieving enhanced prompts in both high-resource and low-resource scenarios. Additionally, the research team synthesized a new benchmark dataset named MaritimeEvent and proved the importance of retrieval enhancement in improving event extraction performance through detailed analysis and case studies.