Abbreviation Prediction Using Conditional Random Field and Web Data

JIAO Yan,WANG Houfeng,ZHANG Longkai
DOI: https://doi.org/10.3969/j.issn.1003-0077.2012.02.012
2012-01-01
Abstract:Abbreviations are commonly used in natural languages and constitutes a substantial proportion of Unknown Words,which challenges Natural Language Processing.This article proposes a strategy of predicting abbreviation from full form in Chinese.For a full form,it firstly generates a number of candidates using Conditional Random Field.Then each of the candidates is re-scored according to the results from Web Search Engine based on different search conditions and statistic methods.The candidate with highest score is selected as the abbreviation.Experiments show the precision improves about 5% compared with single Conditional Random Field method.
What problem does this paper attempt to address?