Research on entity relation extraction for Chinese medical text

Yonghe Lu,Hongyu Chen,Yueyun Zhang,Jiahui Peng,Dingcheng Xiang,Jinxia Zhang
DOI: https://doi.org/10.1177/14604582241274762
2024-08-11
Health Informatics Journal
Abstract:Health Informatics Journal, Volume 30, Issue 3, July-September 2024. Currently, the primary challenges in entity relation extraction are the existence of overlapping relations and cascading errors. In addressing these issues, both CasRel and TPLinker have demonstrated their competitiveness. This study aims to explore the application of these two models in the context of entity relation extraction from Chinese medical text. We evaluate the performance of these models using the publicly available dataset CMeIE and further enhance their capabilities through the incorporation of pre-trained models that are tailored to the specific characteristics of the text. The experimental findings demonstrate that the TPLinker model exhibits a heightened and consistent boosting effect compared to CasRel, while also attaining superior performance through the utilization of advanced pre-trained models. Notably, the MacBERT + TPLinker combination emerges as the optimal choice, surpassing the benchmark model by 12.45% and outperforming the leading model ERNIE-Health 3.0 in the CBLUE challenge by 2.31%.
health care sciences & services,medical informatics
What problem does this paper attempt to address?