An Algorithm of Chinese Domain Term Extraction Based on Language Feature

Ji-bin FU,Xiao-zhong FAN,Jin-tao MAO,Zheng-tao YU
DOI: https://doi.org/10.15918/j.tbit1001-0645.2010.03.020
2010-01-01
Abstract:An algorithm for Chinese domain term extraction based on language feature is proposed.Domain terms in Chinese have three features: domain cohesiveness,domain relevancy and domain consensus.The algorithm to extract domain term integrates three statistical models which compute domain cohesiveness,domain relevancy and domain consensus respectively.Experimental results show that the algorithm has higher precision and recall than the method based on mutual information and log-likelihood.An automatic evaluation method based on perplexity attenuation ratio is proposed,and the above algorithms are measured by the automatic evaluation method.
What problem does this paper attempt to address?