A Syntax-enhanced Model Based on Category Keywords for Biomedical Relation Extraction

Xiaofeng Liu,Jiajie Tan,Jianye Fan,Kaiwen Tan,Jinlong Hu,Shoubin Dong
DOI: https://doi.org/10.1016/j.jbi.2022.104135
IF: 8
2022-01-01
Journal of Biomedical Informatics
Abstract:Certain categories in multi-category biomedical relationship extraction have linguistic similarities to some extent. Keywords related to categories and syntax structures of samples between these categories have some notable features, which are very useful in biomedical relation extraction. The pre-trained model has been widely used and has achieved great success in biomedical relationship extraction, but it is still incapable of mining this kind of information accurately. To solve the problem, we present a syntax-enhanced model based on category keywords. First, we prune syntactic dependency trees in terms of category keywords obtained by the chi-square test. It reduces noisy information caused by current syntactic parsing tools and retains useful information related to categories. Next, to encode category-related syntactic dependency trees, a syntactic transformer is presented, which enhances the ability of the pre-trained model to capture syntax structures and to distinguish multiple categories. We evaluate our method on three biomedical datasets. Compared with state-of-the-art models, our method performs better on these datasets. We conduct further analysis to verify the effectiveness of our method.
What problem does this paper attempt to address?