Chinese Name Entity Extraction System Based on a Hybrid Model

WANG Rui,ZHANG Jie,ZHANG Youyi,YU Zhen,YAO Tianfang
DOI: https://doi.org/10.3321/j.issn:1000-0054.2005.09.039
2005-01-01
Abstract:After summarizing and analyzing the state of the art on Chinese name entity extraction, we emphasize that three fundamental problems including word segmentation, domain, and method should be solved. Then we brought forward corresponding solutions: using rules to correct errors in texts after word segmentation; establishing specific rules for different domains based on a new Mountain Chain model; and combining statistical with linguistic method for treating different kinds of name entity separately. According to the experimental results, word segmentation errors will affect on the final results greatly; domain-specific rules are helpful to improve the extraction; and combination of diverse methods is better than a single one does.
What problem does this paper attempt to address?