Abstract:Extracting entities and relations from text is a significant task of information extraction. Existing extraction models often straightforwardly produce their confident prediction results without any reconsideration or double-checking, resulting in avoidable mistakes and sub-optimal performance. In this paper, we propose a novel coarse-to-fine extraction framework, which first extracts high-potential relations as well as entities via knowledge distillation, and then rechecks the predictions via handcrafted natural language inference (NLI) task in a fine-grained manner. Specifically, based on the knowledge distillation mechanism, we train multiple teacher models iteratively through an adaptive loss function for making one teacher concentrate more on the data that others are incompetent for. Then, these complementary teacher models are utilized to provide valuable soft-label information for training a considerate student model, enabling it to generate reliable preliminary predictions. Further, these generated potential relations and entities are formulated as hypotheses, together with the original sentences as premises, serving as the input for an NLI model. Considering the linguistic diversity of relational expression, we automatically generate various semantic templates for hypotheses through an <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$\mathcal{N}$</tex> -gram mining strategy. Moreover, due to the existence of multi-fact sentences, a relation-guided Gaussian attention is designed to reduce the gap between the single-relation hypothesis and the multi-relation premise. To implement efficient training, we also develop several ways to generate high-quality negative samples, which help the NLI model learn to identify errors. Experimental results show that the proposed method is effective and outperforms other strong baselines on public benchmarks.

Entity Relation Extraction as Dependency Parsing in Visually Rich Documents

Entity-Relation Extraction As Multi-Turn Question Answering

A LayoutLMv3-Based Model for Enhanced Relation Extraction in Visually-Rich Documents

Entity-Relation Extraction As Full Shallow Semantic Dependency Parsing

RE$^2$: Region-Aware Relation Extraction from Visually Rich Documents

Graph Convolution for Multimodal Information Extraction from Visually Rich Documents

Entity-centered Cross-document Relation Extraction

Discovering Medical Entity Relations from Texts using Dependency Information

Towards Human-Like Machine Comprehension: Few-Shot Relational Learning in Visually-Rich Documents

VRDSynth: Synthesizing Programs for Multilingual Visually Rich Document Information Extraction

A Coarse-to-Fine Framework for Entity-Relation Joint Extraction.

TRIE++: Towards End-to-End Information Extraction from Visually Rich Documents

Joint Extraction of Entities and Relations Based on a Novel Decomposition Strategy

Modeling Entities as Semantic Points for Visual Information Extraction in the Wild

DiVA-DocRE: A Discriminative and Voice-Aware Paradigm for Document-Level Relation Extraction

Performance of representation fusion model for entity and relationship extraction within unstructured text

Document-level Relation Extraction as Semantic Segmentation

Document-Level Relation Extraction with Entity Enhancement and Context Refinement

EMGE: Entities and Mentions Gradual Enhancement with semantics and connection modeling for document-level relation extraction

Document-level Relation Extraction via Separate Relation Representation and Logical Reasoning

Syntax-aware entity representations for neural relation extraction.