Abstract:Abstract Background Clinical named entity recognition (CNER) is important for medical information mining and establishment of high-quality knowledge map. Due to the different text features from natural language and a large number of professional and uncommon clinical terms in Chinese electronic medical records (EMRs), there are still many difficulties in clinical named entity recognition of Chinese EMRs. It is of great importance to eliminate semantic interference and improve the ability of autonomous learning of internal features of the model under the small training corpus. Methods From the perspective of deep learning, we integrated the attention mechanism into neural network, and proposed an improved clinical named entity recognition method for Chinese electronic medical records called BiLSTM-Att-CRF, which could capture more useful information of the context and avoid the problem of missing information caused by long-distance factors. In addition, medical dictionaries and part-of-speech (POS) features were also introduced to improve the performance of the model. Results Based on China Conference on Knowledge Graph and Semantic Computing (CCKS) 2017 and 2018 Chinese EMRs corpus, our BiLSTM-Att-CRF model finally achieved better performance than other widely-used models without additional features(F1-measure of 85.4% in CCKS 2018, F1-measure of 90.29% in CCKS 2017), and achieved the best performance with POS and dictionary features (F1-measure of 86.11% in CCKS 2018, F1-measure of 90.48% in CCKS 2017). In particular, the BiLSTM-Att-CRF model had significant effect on the improvement of Recall. Conclusions Our work preliminarily confirmed the validity of attention mechanism in discovering key information and mining text features, which might provide useful ideas for future research in clinical named entity recognition of Chinese electronic medical records. In the future, we will explore the deeper application of attention mechanism in neural network.

Named Entity Recognition for Long COVID Biomedical Literature by Using Bert-BiLSTM-IDCNN-ATT-CRF Approach

Named Entity Recognition from Biomedical Texts Using a Fusion Attention-Based BiLSTM-CRF.

Document-level Attention-Based BiLSTM-CRF Incorporating Disease Dictionary for Disease Named Entity Recognition

Biomedical Named Entity Recognition via A Hybrid Neural Network Model

Attention-Based LSTM Network for COVID-19 Clinical Trial Parsing

Disease Named Entity Recognition from Biomedical Literature Using a Novel Convolutional Neural Network

Long short-term memory RNN for biomedical named entity recognition

Research on Named Entity Recognition from Biomedical Literature

SBLC: a Hybrid Model for Disease Named Entity Recognition Based on Semantic Bidirectional LSTMs and Conditional Random Fields

An attention-based deep learning model for clinical named entity recognition of Chinese electronic medical records

Biomedical Named Entity Recognition with CNN-BLSTM-CRF

Study on Named Entity Recognition in Chinese Literatures on Hypertension treatment

An Attention-Based BiLSTM-CRF Model for Chinese Clinic Named Entity Recognition

Accurate Name Entity Recognition for Biomedical Literatures: A Combined High-quality Manual Annotation and Deep-learning Natural Language Processing Study

Fine-Grained Named Entity Recognition with Distant Supervision in COVID-19 Literature

Exploring Recurrent Neural Networks To Detect Named Entities From Biomedical Text

Medical Named Entity Recognition Fusing Part-of-Speech and Stroke Features

A BIGRU-Based Stacked Attention Network for Biomedical Named Entity Recognition with Chinese EMRs

HDCNN-CRF for Biomedical Text Named Entity Recognition

A Neural Framework for Chinese Medical Named Entity Recognition.

Chinese Clinical Named Entity Recognition Via Multi-Head Self-Attention Based BiLSTM-CRF