EMGE: Entities and Mentions Gradual Enhancement with semantics and connection modeling for document-level relation extraction

Guojun Chen,Panfeng Chen,Qi Wang,Hui Li,Xin Zhou,Xibin Wang,Aihua Yu,Xingzhi Deng
DOI: https://doi.org/10.1016/j.knosys.2024.112777
IF: 8.139
2024-11-30
Knowledge-Based Systems
Abstract:Relation extraction is the process of identifying connections between entities in unstructured text and is a critical component of entity-centred information extraction to uncover latent knowledge structures in complex documents. Although graph-based methods have pushed the state-of-the-art forward in relation extraction, current approaches still exhibit limitations. These include incomplete capture of graph structural features, inadequate modeling of long-distance dependencies and imprecise representation of complex entity interactions. A novel E ntities and M entions G radual E nhancement framework called EMGE is proposed. It integrates both contextual and structural information to robustly enhance entity representations for document-level relation extraction. It comprises three primary components: 1) a dynamic relation aware enhancement mechanism to comprehensively encode graph structural features; 2) a multi-scale feature enhancement module to effectively capture long-distance dependencies; and 3) an entity mention-pair enhancement mechanism to yield precise representations of classification targets. Extensive empirical evaluation on five widely-adopted datasets demonstrates that EMGE achieves promising performance. Particularly noteworthy are the substantial gains obtained on the challenging CDR dataset, where EMGE achieved relative improvements of 1.7%, 11.4%, and 4.1% over the strongest baseline in terms of the Intra-F1, Inter-F1 and Overall-F1 metrics, respectively. Further experimental results demonstrate that the proposed model outperforms the popular large language model in relation extraction tasks. Our code is available on github. 1
computer science, artificial intelligence
What problem does this paper attempt to address?