A Hybrid Model for Chinese Confusable Words Distinguishing in Proofreading

Luozheng Li,Peipei Song,Dan Zhang,Dongyan Zhao
DOI: https://doi.org/10.1007/978-3-031-06703-7_36
2022-01-01
Abstract:Distinguishing the confusable words is an essential task for Chinese teaching and publications proofreading. Existing studies have made progress in analyzing confusable words in the lexical semantic view. However, few effective automated methods are applied to solve this problem. This paper proposes a hybrid model to distinguish the confusable words in proofreading. Through ensemble learning, the model combines contextual and non-contextual features of the input sentence. Experimental results on two test sets demonstrate that the hybrid model achieves superior performance against other baseline methods including solely BERT.
What problem does this paper attempt to address?