Abstract:Extracting entities and relations from text is a significant task of information extraction. Existing extraction models often straightforwardly produce their confident prediction results without any reconsideration or double-checking, resulting in avoidable mistakes and sub-optimal performance. In this paper, we propose a novel coarse-to-fine extraction framework, which first extracts high-potential relations as well as entities via knowledge distillation, and then rechecks the predictions via handcrafted natural language inference (NLI) task in a fine-grained manner. Specifically, based on the knowledge distillation mechanism, we train multiple teacher models iteratively through an adaptive loss function for making one teacher concentrate more on the data that others are incompetent for. Then, these complementary teacher models are utilized to provide valuable soft-label information for training a considerate student model, enabling it to generate reliable preliminary predictions. Further, these generated potential relations and entities are formulated as hypotheses, together with the original sentences as premises, serving as the input for an NLI model. Considering the linguistic diversity of relational expression, we automatically generate various semantic templates for hypotheses through an <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$\mathcal{N}$</tex> -gram mining strategy. Moreover, due to the existence of multi-fact sentences, a relation-guided Gaussian attention is designed to reduce the gap between the single-relation hypothesis and the multi-relation premise. To implement efficient training, we also develop several ways to generate high-quality negative samples, which help the NLI model learn to identify errors. Experimental results show that the proposed method is effective and outperforms other strong baselines on public benchmarks.

A Knowledge Extraction Framework for Domain-Specific Application with Simplified Pre-Trained Language Model and Attention-Based Feature Extractor

A lattice LSTM-based framework for knowledge graph construction from power plants maintenance reports

Development and Evaluation of Task-Specific NLP Framework in China.

A Computational Framework for Effective Representation and Extraction of Knowledge Graph for Power Plant Maintenance and Overhaul.

Knowledge enhanced graph inference network based entity-relation extraction and knowledge graph construction for industrial domain

An Automatic and General Framework for Domain-Specific Knowledge Bases Extracting

Knowledge Automation Through Graph Mining, Convolution and Explanation Framework: A Soft Sensor Practice

Generalized knowledge-enhanced framework for biomedical entity and relation extraction

A ModelOps-based Framework for Intelligent Medical Knowledge Extraction

Research on Domain-Specific Knowledge Graph Based on the RoBERTa-wwm-ext Pretraining Model

A Framework Using Active Learning to Rapidly Perform Named Entity Extraction and Relation Recognition for Science and Technology Knowledge Graph

KICE: A Knowledge Consolidation and Expansion Framework for Relation Extraction.

Knowledge Extraction in Low-Resource Scenarios: Survey and Perspective

Named Entity Recognition in Equipment Support Field Using Tri-Training Algorithm and Text Information Extraction Technology

“FabNER”: information extraction from manufacturing process science domain literature using named entity recognition

DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population

Short Text Mining Framework with Specific Design for Operation and Maintenance of Power Equipment

A Named Entity Extraction Method for Commonly Used Steel Knowledge Graph

Integrating deep learning and multi-attention for joint extraction of entities and relationships in engineering consulting texts

A GPT-assisted iterative method for extracting domain knowledge from a large volume of literature of electromagnetic wave absorbing materials with limited manually annotated data

A Coarse-to-Fine Framework for Entity-Relation Joint Extraction.