One Sense Per N-gram

Pengyuan Liu,Shui Liu,Shiqi Li,Shiwen Yu
DOI: https://doi.org/10.1109/wi-iat.2010.268
2010-01-01
Abstract:This paper presents a novel supposition, One Sense Per N-gram (N1), which we believe is appropriate for more linguistic phenomena and can serve as a general version instead of the celebrated One Sense Per Collocation supposition, at least in Chinese language. This new supposition is based on our observation of the error detection process of annoted sense in People’s Daily that are tagged by an automatic WSD system. Our preliminary experiment on Chinese Word Sense Tagging Corpus shows that it holds with over 85.9% agreement for both nouns and verbs. Based on the supposition we build a prototype naïve Bayes WSD system and tested on Multilingual Chinese-English Lexical Sample task (MCELS) in Semeval-2007. Experimental results show our prototype system can promote the performance of baseline system by 2.7%.
What problem does this paper attempt to address?