Abstract:The selection of features is critical in providing discriminative information for classifiers in Word Sense Disambiguation (WSD). Uninformative features will degrade the performance of classifiers. Based on the strong evidence that an ambiguous word expresses a unique sense in a given collocation, this paper reports our experiments on automatic WSD using collocation as local features based on the corpus extracted from People’s Daily News (PDN) as well as the standard SENSEVAL-3 data set. Using the Naive Bayes classifier as our core algorithm, we have implemented a classifier using a feature set combining both local collocation features and topical features. The average precision on the PDN corpus has 3.2% improvement compared to 81.5% of the baseline system where collocation features are not considered. For the SENSEVAL-3 data, we have reached the precision rate of 37.6% by integrating collocation features into contextual features, to achieve 37% improvement over 26.7% of precision in the baseline system. Our experiments have shown that collocation features can be used to reduce the size of human tagged corpus.

Optimizing Feature Set for Chinese Word Sense Disambiguation.

Unsupervised Word Sense Disambiguation Based on WordNet

Coarse-Grained Word Sense Disambiguation Using Features Described in the Lexicon

Word Sense Disambiguation Based on Improved Bayesian Classifiers

Chinese Verb Sense Disambiguation Using AdaBoosting

HIT-IR-WSD: A WSD System for English Lexical Sample Task.

Integrating Collocation Features in Chinese Word Sense Disambiguation.

Chinese Word Sense Disambiguation Using a LSTM

Two statistics methods of Chinese word sense disambiguation

A Study in Dictionary-Based All-word Word Sense Disambiguation for Pre-Qin Chinese

Chinese WSD Based on Selecting the Best Seeds from Collocations

Word Sense Indicators: Effective Feature For Chinese Word Sense Disambiguation

Leveraging Word-Formation Knowledge for Chinese Word Sense Disambiguation

Improving Topic Extraction in Chinese Documents Using Word Sense Disambiguation

Word Sense Disambiguation Method with Topic Feature

Multi-strategy approach to Chinese word sense disambiguation based on sememe relations

A Unified Model for Word Sense Representation and Disambiguation.

Chinese Word Sense Disambiguation Based on Context Expansion.

A survey of Chinese word sense disambiguation:Resources,methods and evaluation

Using BERT for Word Sense Disambiguation

Word Sense Disambiguation Based on Positional Weighted Context