Abstract:Information extraction (IE) aims to extract meaningful structured tuples from unstructured text. Existing studies usually utilize a pre-trained generative language model that rephrases the original sentence into a target sequence, which can be easily decoded as tuples. However, traditional evaluation metrics treat a slight error within the tuple as an entire prediction failure, which is unable to perceive the correctness extent of a tuple. For this reason, we first propose a novel IE evaluation metric called Matching Score to evaluate the correctness of the predicted tuples in more detail. Moreover, previous works have ignored the effects of semantic uncertainty when focusing on the generation of the target sequence. We argue that leveraging the built-in semantic uncertainty of language models is beneficial for improving its robustness. In this work, we propose Binomial distribution guided counterpart sequence (BCS) method, which is a model-agnostic approach. Specifically, we propose to quantify the built-in semantic uncertainty of the language model by bridging all local uncertainties with the whole sequence. Subsequently, with the semantic uncertainty and Matching Score, we formulate a unique binomial distribution for each local decoding step. By sampling from this distribution, a counterpart sequence is obtained, which can be regarded as a semantic complement to the target sequence. Finally, we employ the Kullback-Leibler divergence to align the semantics of the target sequence and its counterpart. Extensive experiments on 14 public datasets over 5 information extraction tasks demonstrate the effectiveness of our approach on various methods. Our code and dataset are available at https://github.com/byinhao/BCS.

Set Learning for Generative Information Extraction

Unified Structure Generation for Universal Information Extraction

Sets2Sets

A Joint Learning Information Extraction Method Based on an Effective Inference Structure

Contrastive Information Extraction with Generative Transformer

LasUIE: Unifying Information Extraction with Latent Adaptive Structure-aware Generative Language Model

Adaptive Ordered Information Extraction with Deep Reinforcement Learning

A Regularization-based Transfer Learning Method for Information Extraction via Instructed Graph Decoder

A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction

SetCSE: Set Operations using Contrastive Learning of Sentence Embeddings

One2Set: Generating Diverse Keyphrases As a Set

Towards Robust Information Extraction Via Binomial Distribution Guided Counterpart Sequence

Structured Language Generation Model for Robust Structure Prediction

SetGNER: General Named Entity Recognition As Entity Set Generation.

Efficient Data Learning for Open Information Extraction with Pre-trained Language Models

Two are Better than One: Joint Entity and Relation Extraction with Table-Sequence Encoders

Learning Structured Embeddings of Knowledge Graphs with Generative Adversarial Framework

Supervised structure learning

Unsupervised Structure Learning of Stochastic And-Or Grammars

Set-to-Sequence Methods in Machine Learning: a Review

CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors