Constructing a Chinese Electronic Medical Record Corpus for Named Entity Recognition on Resident Admit Notes

Yan Gao,Lei Gu,Yefeng Wang,Yandong Wang,Feng Yang
DOI: https://doi.org/10.1186/s12911-019-0759-2
IF: 3.298
2019-01-01
BMC Medical Informatics and Decision Making
Abstract:BackgroundElectronic Medical Records(EMRs) contain much medical information about patients. Medical named entity extracting from EMRs can provide value information to support doctors' decision making. The research on information extraction of Chinese Electronic Medical Records is still behind that has done in English.MethodsThis paper proposed a practical annotation scheme for medical entity extraction on Resident Admit Notes (RANs), and a model which can automatic extract medical entity. Nine types of clinical entities, four types of clinical relationships were defined in our annotation scheme. An end-to-end deep neural network with convolution neural network and long-short term memory units was applied in our model for the medical named entity recognition(NER).ResultWe annotated RANs in three rounds. The overall F-score of annotation consistency was up to 97.73%. And our NER model on the 255 annotated RANs achieved the best F-score of 91.08%.ConclusionThe annotation scheme and the model for NER in this paper are effective to extract medical named entity from RANs and provide the basis for fully excavating the patient's information.
What problem does this paper attempt to address?