Named Entity Extraction for Chinese Electronic Medical Records.

Hongjie Fan,Dongsheng Wang,Songtao Ye
DOI: https://doi.org/10.1145/3374587.3374612
2019-01-01
Abstract:Named entity extraction task refers to identifying and extracting proper named entities from natural language texts. It is the key task in knowledge graph construction. Disease, symptom and drug entities are widely distributed in Chinese electronic medical records (EMRs). Extracting high-quality medical entities from EMR plays an important role in building medical knowledge graph, medical question & answer and assistance decision making. For the widely distributed entities, in this paper, we propose an end-to-end named entity extraction framework, which uses popular deep learning based approach, known as conditional random field (CRF), bidirectional-long short-term memory (Bi-LSTM+CRF) and BERT+Bi-LSTM+CRF for training and testing the named entities. These models are tested on real medical records, and the experimental results show that the method can effectively identify the entities, and has certain practical value.
What problem does this paper attempt to address?