Named Entity Extraction of Traditional Chinese Medicine Medical Records Based on Conditional Random Field

Liu Kai,Zhou Xuezhong,Yu Jian,Zhang Runshun
DOI: https://doi.org/10.3969/j.issn.1000-3428.2014.09.062
2014-01-01
Abstract:Traditional Chinese Medicine( TCM) medical records are the important data resources of the TCM medical research. The main form of them is still text now,and it is necessary to extract the structured information from the medical records,while named entity extraction is the basic step. It makes 413 copies of manually labeled medical records in Chinese text and four types of feature templates to study about the named entity extraction practice such as symptoms, diseases and incentives. It compares the results of TCM medical records named entity extraction by Conditional Random Field( CRF ) , Hidden Markov Model ( HMM ) and Maximum Entropy Markov Model ( MEMM ) . Combined with appropriate feature templates,CRF has well performance of F1:symptoms 0. 80,the name of the disease 0. 74,incentives 0. 74. Compared with HMM and MEMM,CRF has the highest precision and recall rate. This preliminary shows that CRF is an applicable method of the Chinese medical records named entity extraction.
What problem does this paper attempt to address?