Research on Corpus Creation and Development of Chinese Traditional Medicine

刘耀,段慧明,王惠临,周扬,王振国,李宏展
DOI: https://doi.org/10.3969/j.issn.1003-0077.2008.04.004
2008-01-01
Abstract:Domain corpus is essential to the natural language processing for domain documents,especially for its content and intention analysis.Based on the specific research background,this paper first elaborates the necessity and significance of natural language processing for domain documents.After the analysis on the characteristics of the domain corpus,this paper probes into the design strategy and principle of domain corpus construction.Meanwhile, it also investigates into the part of speech tagging in the corpus.Finally a human-aided processing system for domain corpus is developed,providing some theoretical guidance and technique support for domain corpus construction.
What problem does this paper attempt to address?