GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction

Urchade Zaratiana,Nadi Tomeh,Niama El Khbir,Pierre Holat,Thierry Charnois
2024-04-19
Abstract:Information extraction (IE) is an important task in Natural Language Processing (NLP), involving the extraction of named entities and their relationships from unstructured text. In this paper, we propose a novel approach to this task by formulating it as graph structure learning (GSL). By formulating IE as GSL, we enhance the model's ability to dynamically refine and optimize the graph structure during the extraction process. This formulation allows for better interaction and structure-informed decisions for entity and relation prediction, in contrast to previous models that have separate or untied predictions for these tasks. When compared against state-of-the-art baselines on joint entity and relation extraction benchmarks, our model, GraphER, achieves competitive results.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
### Problems Addressed by the Paper The paper attempts to address the issues of entity recognition and relation extraction in the Information Extraction (IE) task. Specifically, the authors propose a new method that models this task as Graph Structure Learning (GSL). Through this modeling approach, the model can dynamically optimize the graph structure during the extraction process, thereby better handling the prediction of entities and relations. Compared to traditional independent or separate prediction methods, this approach can better utilize graph structure information, improving the accuracy and consistency of predictions. ### Main Contributions 1. **New Modeling Paradigm**: Modeling the information extraction task as a graph structure learning problem enhances the model's dynamic optimization capability during the extraction process. 2. **Application of Graph Neural Networks**: Using Graph Neural Networks (GNN) to enrich the representation of nodes and edges and perform graph structure editing operations. 3. **Competitive Experimental Results**: The proposed model GraphER achieves performance comparable to or even better than existing state-of-the-art methods on multiple benchmark datasets. ### Method Overview 1. **Initial Graph Construction**: - **Span Representation**: Convert the input text into contextual embeddings and compute the representation of each span. - **Node Selection**: Select the top K spans with the highest scores as nodes by calculating the score of each span. - **Edge Selection**: Calculate the scores of potential edges and select the top K edges with the highest scores. 2. **Graph Structure Learning**: - **Graph Representation Learning**: Use Graph Neural Networks (specifically TokenGT) to enrich the representation of nodes and edges. - **Graph Editing**: Decide whether to retain these elements by calculating the retention probability of each node and edge, ultimately forming the final graph structure. 3. **Classification**: - **Node and Edge Classification**: Classify the nodes and edges in the final graph structure to determine their types. ### Experimental Results - **Main Results**: On the CoNLL 2004 and SciERC datasets, GraphER outperforms existing state-of-the-art methods, especially in the relation extraction task. - **Comparison with Message Passing GNNs**: Compared to traditional message passing Graph Neural Networks (such as GCN, GAT, and SAGE), GraphER performs significantly better in the relation extraction task, particularly on complex and noisy datasets. ### Conclusion By modeling the information extraction task as a graph structure learning problem, GraphER can better utilize graph structure information, improving the accuracy of entity and relation predictions. Experimental results show that this method is competitive on multiple benchmark datasets, especially excelling in handling complex and noisy data.