Chinese WSD Based on Selecting the Best Seeds from Collocations

Liu Hui
Abstract:The key problem of word sense disambiguation based on statistic model lies in how to acquiring the word sense indicators automatically. Although it is feasible to acquire a large number of collocations by learning examples, it is hard to select good seeds manually to increase new collocations effectively. The method of selecting the best seeds by machine learning is provided in this paper to solve this problem. The best seeds are used to augment more new word sense indicators; finally disambiguate polysemous words with the acquired indicators. The average accuracy is 87.7% for 8 polysemous words by this method.
Computer Science
What problem does this paper attempt to address?