Identifying Chinese Place Names Based on Support Vector Machines and Rules

LI Li-shuang,HUANG De-gen,CHEN Chun-rong,YANG Yuan-sheng
DOI: https://doi.org/10.3969/j.issn.1003-0077.2006.05.008
2006-01-01
Abstract:By analyzing the characteristics of place names in Chinese texts,a method of automatic recognition of Chinese place names is presented,which combining support vector machines(SVMs) with rules.Firstly,feature vectors based on characters are extracted,and transferred into binary vectors.A training set is established,and the machine learning models for automatic identification of Chinese place names are obtained using polynomial kernel functions.Then,through careful error analysis,a rulebase is constructed and a post-processing step based on it is used,to overcome the shortcoming of low recall of machine learning model.The results show that the method is efficient for identifying Chinese place names.In open test,the recall,precision and F-measure reach 89.57%,93.52% and 91.50% respectively.
What problem does this paper attempt to address?