Chronic Disease Related Entity Extraction in Online Chinese Question and Answer Services.

Yan Zhang,Yong Zhang,Yanshen Yin,Jennifer Xu,Chunxiao Xing,Hsinchun Chen
DOI: https://doi.org/10.1007/978-3-319-29175-8_6
2016-01-01
Abstract:Chinese chronic disease entity extraction aims to extract health related entities from online questions and answers QA. Our research tackles challenges in Chinese chronic disease entity extraction from three aspects: Chinese health lexicons construction, feature development, and equivalence conjunctions tagging. We construct large scale Chinese health lexicons based on expert knowledge and the Web resources; develop a feature extraction approach that draws out character, part-of-speech, and lexical features from QA data; and improve the performance of answer entity extraction by leveraging equivalence conjunctions punctuation marks and conjunctional words in Chinese to capture dependencies between tags of entities. Experiments on question and answer entity extraction demonstrate that the Precision, Recall and F-1 score are improved using our proposed features, and the Precision and F-1 score can be further improved by considering equivalence conjunctions.
What problem does this paper attempt to address?