Abstract:Extracting entities and relations from text is a significant task of information extraction. Existing extraction models often straightforwardly produce their confident prediction results without any reconsideration or double-checking, resulting in avoidable mistakes and sub-optimal performance. In this paper, we propose a novel coarse-to-fine extraction framework, which first extracts high-potential relations as well as entities via knowledge distillation, and then rechecks the predictions via handcrafted natural language inference (NLI) task in a fine-grained manner. Specifically, based on the knowledge distillation mechanism, we train multiple teacher models iteratively through an adaptive loss function for making one teacher concentrate more on the data that others are incompetent for. Then, these complementary teacher models are utilized to provide valuable soft-label information for training a considerate student model, enabling it to generate reliable preliminary predictions. Further, these generated potential relations and entities are formulated as hypotheses, together with the original sentences as premises, serving as the input for an NLI model. Considering the linguistic diversity of relational expression, we automatically generate various semantic templates for hypotheses through an <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$\mathcal{N}$</tex> -gram mining strategy. Moreover, due to the existence of multi-fact sentences, a relation-guided Gaussian attention is designed to reduce the gap between the single-relation hypothesis and the multi-relation premise. To implement efficient training, we also develop several ways to generate high-quality negative samples, which help the NLI model learn to identify errors. Experimental results show that the proposed method is effective and outperforms other strong baselines on public benchmarks.

Extending Dictionary-Based Entity Extraction to Tolerate Errors.

An Efficient Trie-based Method for Approximate Entity Extraction with Edit-Distance Constraints

A Unified Framework for Approximate Dictionary-Based Entity Extraction.

Faerie: efficient filtering algorithms for approximate dictionary-based entity extraction.

2ED: An Efficient Entity Extraction Algorithm Using Two-Level Edit-Distance

Entity-Relation Extraction As Multi-Turn Question Answering

Boosting approximate dictionary-based entity extraction with synonyms

Reserch of Entity Matching Based on Multiple Heterogenous Data

A Technical Report: Entity Extraction Using Both Character-based and Token-based Similarity

A Coarse-to-Fine Framework for Entity-Relation Joint Extraction.

EXACT: Attributed Entity Extraction By Annotating Texts

Entity disambiguation with context awareness in user-generated short texts

Error-Tolerant Big Data Processing

A New Entity Extraction Method Based on Machine Reading Comprehension

A multiple head selection joint entity-relation extraction model

Extract and Attend: Improving Entity Translation in Neural Machine Translation

A Cascade Dual-Decoder Model for Joint Entity and Relation Extraction

Entity Disambiguation via Fusion Entity Decoding

AML: Efficient Approximate Membership Localization within a Web-Based Join Framework

DABC: A Named Entity Recognition Method Incorporating Attention Mechanisms

An Entity-Relation Joint Extraction Method Based on Two Independent Sub-Modules From Unstructured Text