Abstract:Document-level Event Argument Extraction (EAE) faces two challenges due to increased input length: 1) difficulty in distinguishing semantic boundaries between events, and 2) interference from redundant information. To address these issues, we propose two methods. The first method introduces the Co and Structure Event Argument Extraction model (CsEAE) based on Small Language Models (SLMs). CsEAE includes a co-occurrences-aware module, which integrates information about all events present in the current input through context labeling and co-occurrences event prompts extraction. Additionally, CsEAE includes a structure-aware module that reduces interference from redundant information by establishing structural relationships between the sentence containing the trigger and other sentences in the document. The second method introduces new prompts to transform the extraction task into a generative task suitable for Large Language Models (LLMs), addressing gaps in EAE performance using LLMs under Supervised Fine-Tuning (SFT) conditions. We also fine-tuned multiple datasets to develop an LLM that performs better across most datasets. Finally, we applied insights from CsEAE to LLMs, achieving further performance improvements. This suggests that reliable insights validated on SLMs are also applicable to LLMs. We tested our models on the Rams, WikiEvents, and MLEE datasets. The CsEAE model achieved improvements of 2.1\%, 2.3\%, and 3.2\% in the Arg-C F1 metric compared to the baseline, PAIE~\cite{PAIE}. For LLMs, we demonstrated that their performance on document-level datasets is comparable to that of SLMs~\footnote{All code is available at <a class="link-external link-https" href="https://github.com/simon-p-j-r/CsEAE" rel="external noopener nofollow">this https URL</a>}.

STAR: Boosting Low-Resource Event Extraction by Structure-to-Text Data Generation with Large Language Models

STAR: Boosting Low-Resource Information Extraction by Structure-to-Text Data Generation with Large Language Models

Boosting Event Extraction with Denoised Structure-to-Text Augmentation

Struct-X: Enhancing Large Language Models Reasoning with Structured Data

Schema-based Data Augmentation for Event Extraction

Scale Up Event Extraction Learning via Automatic Training Data Generation

Agent-DA: Enhancing low-resource event extraction with collaborative multi-agent data augmentation

Automatically Labeled Data Generation for Large Scale Event Extraction

Improve Event Extraction via Self-Training with Gradient Guidance

Cascading Large Language Models for Salient Event Graph Generation

Is a Large Language Model a Good Annotator for Event Extraction?

A Structure-aware Generative Model for Biomedical Event Extraction

A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction

Low-Resource Event Extraction via Share-and-Transfer and Remaining Challenges

StrucText-Eval: Evaluating Large Language Model's Reasoning Ability in Structure-Rich Text

Structure-Aware Face Clustering on a Large-Scale Graph with 10(7) Nodes

Unified Text Structuralization with Instruction-tuned Language Models

Star-Agents: Automatic Data Optimization with LLM Agents for Instruction Tuning

Structure-Aware Face Clustering on a Large-Scale Graph With 107 Nodes.

STAR: A First-Ever Dataset and A Large-Scale Benchmark for Scene Graph Generation in Large-Size Satellite Imagery

One Small and One Large for Document-level Event Argument Extraction