Relation Extraction Based on Data Partition and Representation Integration

Jiapeng Zhao,Panpan Zhang,Tingwen Liu,Zhenyu Zhang,Yanzeng Li,Jinqiao Shi
DOI: https://doi.org/10.1109/dsc53577.2021.00017
2021-01-01
Abstract:Relation extraction (RE) is the cornerstone of many natural language processing applications. The success of machine learning algorithms generally depends on the embedding representation of data, since it involves different explanatory factors of variation behind the data. This paper explores integrating multiple representations to improve relation extraction. Note that the shortest dependency path (SDP) can retain relevant information and eliminate irrelevant words in the sentence, we partition the input dataset into multiple subsets based on the deformed SDP. For the input dataset and each subset, we train different encoders respectively. The input dataset encoder pays more attention to words in SDP, while the subset encoders pay more attention to words beyond SDP. In this way, we can get multiple embedding representations of diversity for the same sentence and improve the performance of RE by integrating them with predefined strategies. Experimental results on a widely used dataset demonstrate the effectiveness of our approach.
What problem does this paper attempt to address?