Chinese Name Recognition Based on Boundary Templates and Local Frequency

LI Zhong-guo,LIU Ying
DOI: https://doi.org/10.3969/j.issn.1003-0077.2006.05.007
2006-01-01
Abstract:In this paper an effective algorithm for Chinese person name recognition is proposed.Person name's left and right boundary words and person name's character frequency are extracted from tagged corpus,which will be used as the knowledge for recognition.First we use these boundary templates to find possible person names.Then these recognized person names are used to match the missed occurrence in the text.At last,the local frequency obtained from the whole text is used to check and correct the name boundaries.The time complexity of this algorithm is linear,and the test result on 1,354 news articles(with 3.04 million Chinese characters and 37,014 Chinese names in all) gives the precision of 94.52% and the recall of 98.97%,which is fairly satisfying in comparison with other published algorithms.
What problem does this paper attempt to address?