Protein-protein Interaction Relation Extraction Based on Multigranularity Semantic Fusion.
Yijing Li,Yanping Chen,Yongbin Qin,Ying Hu,Ruizhang Huang,Qinghua Zheng
DOI: https://doi.org/10.1016/j.jbi.2021.103931
IF: 8
2021-01-01
Journal of Biomedical Informatics
Abstract:Extracting semantic relationships about biomedical entities in a sentence is a typical task in biomedical information extraction. Because a sentence usually contains several named entities, it is important to learn global semantics of a sentence to support relation extraction. In related works, many strategies have been proposed to encode a sentence representation relevant to considered named entities. Despite the current success, according to the characteristic of languages, semantics of words are expressed on multigranular levels which also heavily depends on local semantic of a sentence. In this paper, we propose a multigranularity semantic fusion method to support biomedical relation extraction. In this method, Transformer is adopted for embedding words of a sentence into distributed representations, which is effective to encode global semantic of a sentence. Meanwhile, a multichannel strategy is applied to encode local semantics of words, which enables the same word to have different representations in a sentence. Both global and local semantic representations are fused to enhance the discriminability of the neural network. To evaluate our method, experiments are conducted on five standard PPI corpora (AImed, BioInfer, IEPA, HPRD50, and LLL), which achieve F1-scores of 83.4%, 89.9%, 81.2%, 84.5%, and 92.5%, respectively. The results show that multigranular semantic fusion is helpful to support the proteinprotein interaction relationship extraction.