LED: A Dataset for Life Event Extraction from Dialogs

Yi-Pei Chen,An-Zi Yen,Hen-Hsen Huang,Hideki Nakayama,Hsin-Hsi Chen
DOI: https://doi.org/10.48550/arXiv.2304.08327
2023-04-17
Abstract:Lifelogging has gained more attention due to its wide applications, such as personalized recommendations or memory assistance. The issues of collecting and extracting personal life events have emerged. People often share their life experiences with others through conversations. However, extracting life events from conversations is rarely explored. In this paper, we present Life Event Dialog, a dataset containing fine-grained life event annotations on conversational data. In addition, we initiate a novel conversational life event extraction task and differentiate the task from the public event extraction or the life event extraction from other sources like microblogs. We explore three information extraction (IE) frameworks to address the conversational life event extraction task: OpenIE, relation extraction, and event extraction. A comprehensive empirical analysis of the three baselines is established. The results suggest that the current event extraction model still struggles with extracting life events from human daily conversations. Our proposed life event dialog dataset and in-depth analysis of IE frameworks will facilitate future research on life event extraction from conversations.
Computation and Language
What problem does this paper attempt to address?
The paper aims to address the problem of extracting life events from conversations. Specifically, the researchers propose a dataset named Life Event Dialog (LED), which contains fine-grained life event annotations in dialogue data. Additionally, they introduce a new task—dialogue life event extraction—and distinguish it from existing tasks such as public event extraction or life event extraction from other sources like Weibo. By exploring three information extraction frameworks (OpenIE, relation extraction, and event extraction), the paper comprehensively evaluates the performance of these frameworks on the dialogue life event extraction task. The research findings indicate that current event extraction models still face challenges in extracting life events from everyday conversations. The contributions of the paper include the introduction of the LED dataset, the proposal of a new dialogue life event extraction task, and an in-depth analysis of information extraction frameworks. This work will facilitate future research on extracting life events from conversations.