Abstract:Background: Electronic Medical Record (EMR) comprises patients' medical information gathered by medical stuff for providing better health care. Named Entity Recognition (NER) is a sub-field of information extraction aimed at identifying specific entity terms such as disease, test, symptom, genes etc. NER can be a relief for healthcare providers and medical specialists to extract useful information automatically and avoid unnecessary and unrelated information in EMR. However, limited resources of available EMR pose a great challenge for mining entity terms. Therefore, a multitask bi-directional RNN model is proposed here as a potential solution of data augmentation to enhance NER performance with limited data. Methods: A multitask bi-directional RNN model is proposed for extracting entity terms from Chinese EMR. The proposed model can be divided into a shared layer and a task specific layer. Firstly, vector representation of each word is obtained as a concatenation of word embedding and character embedding. Then Bi-directional RNN is used to extract context information from sentence. After that, all these layers are shared by two different task layers, namely the parts-of-speech tagging task layer and the named entity recognition task layer. These two tasks layers are trained alternatively so that the knowledge learned from named entity recognition task can be enhanced by the knowledge gained from parts-of-speech tagging task. Results: The performance of our proposed model has been evaluated in terms of micro average F-score, macro average F-score and accuracy. It is observed that the proposed model outperforms the baseline model in all cases. For instance, experimental results conducted on the discharge summaries show that the micro average F-score and the macro average F-score are improved by 2.41% point and 4.16% point, respectively, and the overall accuracy is improved by 5.66% point. Conclusions: In this paper, a novel multitask bi-directional RNN model is proposed for improving the performance of named entity recognition in EMR. Evaluation results using real datasets demonstrate the effectiveness of the proposed model.

RT: a Retrieving and Chain-of-Thought framework for few-shot medical named entity recognition

Improving Biomedical Named Entity Recognition with a Unified Multi-Task MRC Framework

How far is Language Model from 100% Few-shot Named Entity Recognition in Medical Domain

Demonstration-based learning for few-shot biomedical named entity recognition under machine reading comprehension

CLLMFS: A Contrastive Learning enhanced Large Language Model Framework for Few-Shot Named Entity Recognition

Biomedical named entity recognition using BERT in the machine reading comprehension framework

Long short-term memory RNN for biomedical named entity recognition

A multitask bi-directional RNN model for named entity recognition on Chinese electronic medical records

A Unified MRC Framework for Named Entity Recognition

Advancing entity recognition in biomedicine via instruction tuning of large language models

LLMs in Biomedicine: A study on clinical Named Entity Recognition

Inspire the Large Language Model by External Knowledge on BioMedical Named Entity Recognition

Empowering Biomedical Named Entity Recognition Through Multi-Tagger Collaboration

MMBERT: a unified framework for biomedical named entity recognition

A Knowledge-Enhanced Medical Named Entity Recognition Method that Integrates Pre-Trained Language Models

Large Language Models Struggle in Token-Level Clinical Named Entity Recognition

A pre-training and self-training approach for biomedical named entity recognition

Fighting Against the Repetitive Training and Sample Dependency Problem in Few-shot Named Entity Recognition

ANeTCM: A Novel MRC Framework for Traditional Chinese Medicine Named Entity Recognition