LED: A Dataset for Life Event Extraction from Dialogs

Yi-Pei Chen,An-Zi Yen,Hen-Hsen Huang,Hideki Nakayama,Hsin-Hsi Chen

DOI: https://doi.org/10.48550/arXiv.2304.08327

2023-04-17

Abstract:Lifelogging has gained more attention due to its wide applications, such as personalized recommendations or memory assistance. The issues of collecting and extracting personal life events have emerged. People often share their life experiences with others through conversations. However, extracting life events from conversations is rarely explored. In this paper, we present Life Event Dialog, a dataset containing fine-grained life event annotations on conversational data. In addition, we initiate a novel conversational life event extraction task and differentiate the task from the public event extraction or the life event extraction from other sources like microblogs. We explore three information extraction (IE) frameworks to address the conversational life event extraction task: OpenIE, relation extraction, and event extraction. A comprehensive empirical analysis of the three baselines is established. The results suggest that the current event extraction model still struggles with extracting life events from human daily conversations. Our proposed life event dialog dataset and in-depth analysis of IE frameworks will facilitate future research on life event extraction from conversations.

Computation and Language

What problem does this paper attempt to address?

The paper aims to address the problem of extracting life events from conversations. Specifically, the researchers propose a dataset named Life Event Dialog (LED), which contains fine-grained life event annotations in dialogue data. Additionally, they introduce a new task—dialogue life event extraction—and distinguish it from existing tasks such as public event extraction or life event extraction from other sources like Weibo. By exploring three information extraction frameworks (OpenIE, relation extraction, and event extraction), the paper comprehensively evaluates the performance of these frameworks on the dialogue life event extraction task. The research findings indicate that current event extraction models still face challenges in extracting life events from everyday conversations. The contributions of the paper include the introduction of the LED dataset, the proposal of a new dialogue life event extraction task, and an in-depth analysis of information extraction frameworks. This work will facilitate future research on extracting life events from conversations.

LED: A Dataset for Life Event Extraction from Dialogs

Major Life Event Extraction from Twitter based on Congratulations/Condolences Speech Acts.

Learning to Ask for Data-Efficient Event Argument Extraction

DocEE: A Large-Scale and Fine-grained Benchmark for Document-level Event Extraction

Layout-Aware Information Extraction for Document-Grounded Dialogue: Dataset, Method and Demonstration

LiveChat: A Large-Scale Personalized Dialogue Dataset Automatically Constructed from Live Streaming

Title2Event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset

Timeline: A Chinese Event Extraction and Exploration System

Detecting Events of Daily Living Using Multimodal Data

DocEE-zh: A Fine-grained Benchmark for Chinese Document-level Event Extraction

An overview of event extraction and its applications

TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction

Information Extraction and Human-Robot Dialogue towards Real-life Tasks: A Baseline Study with the MobileCS Dataset

A Survey on Deep Learning Event Extraction: Approaches and Applications

JSEEGraph: Joint Structured Event Extraction as Graph Parsing

Towards Robust Event-guided Low-Light Image Enhancement: A Large-Scale Real-World Event-Image Dataset and Novel Approach

Event Extraction: A Survey

Ten Questions in Lifelog Mining and Information Recall

KETOD: Knowledge-Enriched Task-Oriented Dialogue

EXCEEDS: Extracting Complex Events as Connecting the Dots to Graphs in Scientific Domain

Efficient multiple biomedical events extraction via reinforcement learning