Abstract:Extracting entities and relations from text is a significant task of information extraction. Existing extraction models often straightforwardly produce their confident prediction results without any reconsideration or double-checking, resulting in avoidable mistakes and sub-optimal performance. In this paper, we propose a novel coarse-to-fine extraction framework, which first extracts high-potential relations as well as entities via knowledge distillation, and then rechecks the predictions via handcrafted natural language inference (NLI) task in a fine-grained manner. Specifically, based on the knowledge distillation mechanism, we train multiple teacher models iteratively through an adaptive loss function for making one teacher concentrate more on the data that others are incompetent for. Then, these complementary teacher models are utilized to provide valuable soft-label information for training a considerate student model, enabling it to generate reliable preliminary predictions. Further, these generated potential relations and entities are formulated as hypotheses, together with the original sentences as premises, serving as the input for an NLI model. Considering the linguistic diversity of relational expression, we automatically generate various semantic templates for hypotheses through an <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$\mathcal{N}$</tex> -gram mining strategy. Moreover, due to the existence of multi-fact sentences, a relation-guided Gaussian attention is designed to reduce the gap between the single-relation hypothesis and the multi-relation premise. To implement efficient training, we also develop several ways to generate high-quality negative samples, which help the NLI model learn to identify errors. Experimental results show that the proposed method is effective and outperforms other strong baselines on public benchmarks.

A Fine-Grained Network for Joint Multimodal Entity-Relation Extraction

Joint Multimodal Entity-Relation Extraction Based on Edge-enhanced Graph Alignment Network and Word-pair Relation Tagging

Joint Multimodal Entity-Relation Extraction Based on Temporal Enhancement and Similarity-Gated Attention

Dual-Gated Fusion with Prefix-Tuning for Multi-Modal Relation Extraction

Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model

A Joint Entity and Relation Extraction Approach Using Dilated Convolution and Context Fusion.

Enhancing Interaction Representation for Joint Entity and Relation Extraction.

Exploiting Visual Relation and Multi-Grained Knowledge for Multimodal Relation Extraction

Dual Interactive Attention Network for Joint Entity and Relation Extraction

Joint extraction of entities and relations via an entity correlated attention neural model

Joint Extraction of Entities and Relations Based on Hybrid Feature Representations

ERGM: A multi-stage joint entity and relation extraction with global entity match.

Entity relation joint extraction model combining pointer network and attention mechanism based on relative position embedding

Joint Entity and Relation Extraction Method Fused with Multiple Information

CAG: A Consistency-Adaptive Text-Image Alignment Generation for Joint Multimodal Entity-Relation Extraction

Enhancing Joint Entity and Relation Extraction with Language Modeling and Hierarchical Attention.

A Coarse-to-Fine Framework for Entity-Relation Joint Extraction.

Caption-Aware Multimodal Relation Extraction with Mutual Information Maximization

A Span-based Multi-Modal Attention Network for joint entity-relation extraction

Similarity-based Memory Enhanced Joint Entity and Relation Extraction