MRC-based Nested Medical NER with Co-prediction and Adaptive Pre-training

Xiaojing Du,Hanjie Zhao,Danyan Xing,Yuxiang Jia,Hongying Zan
2024-03-23
Abstract:In medical information extraction, medical Named Entity Recognition (NER) is indispensable, playing a crucial role in developing medical knowledge graphs, enhancing medical question-answering systems, and analyzing electronic medical records. The challenge in medical NER arises from the complex nested structures and sophisticated medical terminologies, distinguishing it from its counterparts in traditional domains. In response to these complexities, we propose a medical NER model based on Machine Reading Comprehension (MRC), which uses a task-adaptive pre-training strategy to improve the model's capability in the medical field. Meanwhile, our model introduces multiple word-pair embeddings and multi-granularity dilated convolution to enhance the model's representation ability and uses a combined predictor of Biaffine and MLP to improve the model's recognition performance. Experimental evaluations conducted on the CMeEE, a benchmark for Chinese nested medical NER, demonstrate that our proposed model outperforms the compared state-of-the-art (SOTA) models.
Computation and Language
What problem does this paper attempt to address?
The paper primarily addresses the issue of Named Entity Recognition (NER) in the medical field, particularly the challenges posed by complex nested structures and specialized medical terminology. The paper proposes a medical domain NER model based on Machine Reading Comprehension (MRC), which improves recognition performance through Co-prediction and Adaptive Pre-training. Specifically, the paper addresses the following key issues: 1. **Handling Nested Structures**: Entities in medical texts often exhibit nested structures, where one entity may contain another. Traditional NER methods struggle to effectively handle such complex structures. Therefore, the authors adopt an approach similar to other studies, transforming the NER task into an MRC task to better handle nested entities. 2. **Utilization of Entity Types**: To more effectively utilize entity type information, the model introduces Conditional Layer Normalization (CLN) and embeds entity types within it to enhance the model's expressiveness. 3. **Multi-granularity Information Fusion**: To capture interaction information between words at different distances, the model employs Multi-granularity Dilated Convolution. Additionally, it introduces various word pair embeddings (such as distance embeddings and region embeddings) and a co-prediction mechanism (combining Biaffine and MLP predictors) to further improve model performance. 4. **Adaptive Pre-training Strategy**: To optimize the model's application in the medical field, the paper proposes a task-adaptive pre-training strategy to fine-tune the original pre-trained model, making it more suitable for NER tasks in the medical domain. In summary, this research aims to improve the accuracy of named entity recognition in medical texts through the aforementioned technical means, achieving better results particularly in handling nested entities. Experimental results show that the proposed model outperforms existing advanced models on the benchmark dataset CMeEE.