Abstract:Information extraction (IE) is an important task in Natural Language Processing (NLP), involving the extraction of named entities and their relationships from unstructured text. In this paper, we propose a novel approach to this task by formulating it as graph structure learning (GSL). By formulating IE as GSL, we enhance the model's ability to dynamically refine and optimize the graph structure during the extraction process. This formulation allows for better interaction and structure-informed decisions for entity and relation prediction, in contrast to previous models that have separate or untied predictions for these tasks. When compared against state-of-the-art baselines on joint entity and relation extraction benchmarks, our model, GraphER, achieves competitive results.

What problem does this paper attempt to address?

### Problems Addressed by the Paper The paper attempts to address the issues of entity recognition and relation extraction in the Information Extraction (IE) task. Specifically, the authors propose a new method that models this task as Graph Structure Learning (GSL). Through this modeling approach, the model can dynamically optimize the graph structure during the extraction process, thereby better handling the prediction of entities and relations. Compared to traditional independent or separate prediction methods, this approach can better utilize graph structure information, improving the accuracy and consistency of predictions. ### Main Contributions 1. **New Modeling Paradigm**: Modeling the information extraction task as a graph structure learning problem enhances the model's dynamic optimization capability during the extraction process. 2. **Application of Graph Neural Networks**: Using Graph Neural Networks (GNN) to enrich the representation of nodes and edges and perform graph structure editing operations. 3. **Competitive Experimental Results**: The proposed model GraphER achieves performance comparable to or even better than existing state-of-the-art methods on multiple benchmark datasets. ### Method Overview 1. **Initial Graph Construction**: - **Span Representation**: Convert the input text into contextual embeddings and compute the representation of each span. - **Node Selection**: Select the top K spans with the highest scores as nodes by calculating the score of each span. - **Edge Selection**: Calculate the scores of potential edges and select the top K edges with the highest scores. 2. **Graph Structure Learning**: - **Graph Representation Learning**: Use Graph Neural Networks (specifically TokenGT) to enrich the representation of nodes and edges. - **Graph Editing**: Decide whether to retain these elements by calculating the retention probability of each node and edge, ultimately forming the final graph structure. 3. **Classification**: - **Node and Edge Classification**: Classify the nodes and edges in the final graph structure to determine their types. ### Experimental Results - **Main Results**: On the CoNLL 2004 and SciERC datasets, GraphER outperforms existing state-of-the-art methods, especially in the relation extraction task. - **Comparison with Message Passing GNNs**: Compared to traditional message passing Graph Neural Networks (such as GCN, GAT, and SAGE), GraphER performs significantly better in the relation extraction task, particularly on complex and noisy datasets. ### Conclusion By modeling the information extraction task as a graph structure learning problem, GraphER can better utilize graph structure information, improving the accuracy of entity and relation predictions. Experimental results show that this method is competitive on multiple benchmark datasets, especially excelling in handling complex and noisy data.

GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction

PGLR: Pseudo Graph and Label Reuse for Entity Relation Extraction

JSEEGraph: Joint Structured Event Extraction as Graph Parsing

An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extraction

GraphIE: A Graph-Based Framework for Information Extraction

GraphERE: Jointly Multiple Event-Event Relation Extraction via Graph-Enhanced Event Embeddings

Entity-Relation Extraction As Full Shallow Semantic Dependency Parsing

Leveraging Knowledge Graph Embeddings to Enhance Contextual Representations for Relation Extraction

GEGA: Graph Convolutional Networks and Evidence Retrieval Guided Attention for Enhanced Document-level Relation Extraction

EMGE: Entities and Mentions Gradual Enhancement with semantics and connection modeling for document-level relation extraction

EnriCo: Enriched Representation and Globally Constrained Inference for Entity and Relation Extraction

Graphene: Semantically-Linked Propositions in Open Information Extraction

Entity-Aware Self-Attention and Contextualized GCN for Enhanced Relation Extraction in Long Sentences

ERDERP: Entity and Relation Double Embedding on Relation Hyperplanes and Relation Projection Hyperplanes

A Coarse-to-Fine Framework for Entity-Relation Joint Extraction.

Entity Resolution with Hierarchical Graph Attention Networks

Distantly Supervised Morpho-Syntactic Model for Relation Extraction

Graph-based Reasoning Model for Multiple Relation Extraction.

End-to-end Relation Extraction Using Graph Convolutional Network with a Novel Entity Attention

Text-Graph Enhanced Knowledge Graph Representation Learning.

Graph-Augmented Relation Extraction Model with LLMs-Generated Support Document