Ensemble of RFR_SUM unigram and bigram for Chinese WSD

Weiguang Qu,Jingsong Yu,Junsheng Zhou,Yanqiu Shao,Sujian Li,Zhifang Sui
2007-01-01
Journal of Computational Information Systems
Abstract:In this paper, we expand a collocation-based WSD model RFR-SUM (sum of Relative Frequency Ratio in context) from unigram (UNIRFRSUM) to bigram (BIRFRSUM) and design two algorithms for BI_RFR_SUM: Simple BI_RFR_SUM algorithm (SBI) and No Intersection BI_RFR_SUM algorithm (NI). We select 7 frequently used polysemous words as examples and the experiments show that the precision of NI algorithm can be adjusted to a very high level. We combine UNI_RFR_SUM with NI algorithm and get a precision of 96.40% with respect to that of TJNI_RFR_SUM 93-23% and SBI 93.32% in open test. This means that the ensemble learning can reduce 46.82% misclassifieation of UNIJRFRSUM model.
What problem does this paper attempt to address?