Chinese Word Sense Disambiguation Based on Latent Maximum Entropy Principle

ZHANG Yangsen,HUANG Gaijuan,SU Wenjie
DOI: https://doi.org/10.3969/j.issn.1003-0077.2012.03.013
2012-01-01
Abstract:We present a new approach to Chinese word sense disambiguation based on latent maximum entropy principle(LME),which is different from Jaynes' maximum entropy principle that only use the context statistical characteristics to construct language model.After studying the relationship between the word and the sememe in Hownet,we convert the word collocation that obtained from the context of training corpus into the sememe collocation,and realize the extraction of text latent semantic features based on sememe collocations.Combined with the traditional context features,the latent maximum entropy principle is applied to disambiguate polysemy words.Experimental results show that the method proposed improves the accuracy by about 4% in the sense disambiguation of 10 polysemous verbs word.
What problem does this paper attempt to address?