Light Coreference Resolution for Russian with Hierarchical Discourse Features

Elena Chistova,Ivan Smirnov

DOI: https://doi.org/10.28995/2075-7182-2023-22-34-41

2023-06-02

Abstract:Coreference resolution is the task of identifying and grouping mentions referring to the same real-world entity. Previous neural models have mainly focused on learning span representations and pairwise scores for coreference decisions. However, current methods do not explicitly capture the referential choice in the hierarchical discourse, an important factor in coreference resolution. In this study, we propose a new approach that incorporates rhetorical information into neural coreference resolution models. We collect rhetorical features from automated discourse parses and examine their impact. As a base model, we implement an end-to-end span-based coreference resolver using a partially fine-tuned multilingual entity-aware language model LUKE. We evaluate our method on the RuCoCo-23 Shared Task for coreference resolution in Russian. Our best model employing rhetorical distance between mentions has ranked 1st on the development set (74.6% F1) and 2nd on the test set (73.3% F1) of the Shared Task. We hope that our work will inspire further research on incorporating discourse information in neural coreference resolution models.

Computation and Language

What problem does this paper attempt to address?

The paper aims to address the problem of coreference resolution in Russian and attempts to introduce hierarchical discourse features into neural network models to improve the effectiveness of coreference resolution. Specifically, the authors propose a new method that integrates rhetorical information into the neural network's coreference resolution model. They collected rhetorical features from automated discourse parsing and evaluated the impact of these features on model performance. The benchmark model used in the study is an end-to-end span-based coreference resolver, partially fine-tuned with the multilingual entity-aware language model LUKE. The authors conducted experiments on the RuCoCo-2023 shared task dataset, and the results showed that the model using rhetorical distance features achieved the best performance on the development set (F1 score of 74.6%) and ranked second on the test set (F1 score of 73.3%). Additionally, the authors demonstrated that even with limited computational resources, the multilingual entity-aware LUKE model could compete with monolingual models dedicated to Russian in the coreference resolution task. The authors hope that their work will inspire more research on integrating discourse information into neural network coreference resolution models.

Light Coreference Resolution for Russian with Hierarchical Discourse Features

Improving Coreference Resolution by Leveraging Entity-Centric Features with Graph Neural Networks and Second-order Inference

Exploring Multiple Strategies to Improve Multilingual Coreference Resolution in CorefUD

Online Coreference Resolution for Dialogue Processing: Improving Mention-Linking on Real-Time Conversations

Coreference Resolution without Span Representations

Coreference Resolution Based on High-Dimensional Multi-Scale Information

Coreferential Reasoning Learning for Language Representation

Coreference Resolution as Query-based Span Prediction

Visual Coreference Resolution in Visual Dialog using Neural Module Networks

End-to-End Neural Discourse Deixis Resolution in Dialogue

Parallel Data Helps Neural Entity Coreference Resolution

Link Type Based Pre-Cluster Pair Model for Coreference Resolution.

A Study on Improving End-to-End Neural Coreference Resolution.

Neural Models for Reasoning over Multiple Mentions Using Coreference

CorefQA: Coreference Resolution As Query-based Span Prediction.

RuCoCo: a new Russian corpus with coreference annotation

Jointly Optimized Neural Coreference Resolution with Mutual Attention

Neural Cross-Lingual Coreference Resolution and its Application to Entity Linking

Investigating the Role of Centering Theory in the Context of Neural Coreference Resolution Systems

Revealing the Myth of Higher-Order Inference in Coreference Resolution

Neural Discourse Modeling of Conversations