Combining Trigram and Automatic Weight Distribution in Chinese Spelling Error Correction

Li Jianhua,Wang Xiaolong
DOI: https://doi.org/10.1007/bf02960784
2002-01-01
Abstract:The researches on spelling correction aiming at detecting errors in texts tend to focus on context-sensitive spelling error correction, which is more difficult than traditional isolated-word error correction. A novel and efficient algorithm for the system of Chinese spelling error correction, CInsunSpell, is presented. In this system, the work of correction includes two parts: checking phase and correcting phase. At the first phase, a Trigram algorithm within one fixed-size window is designed to located potential errors in local area. The second phase employs a new method of automatically and dynamically distributing weights among the characters in the confusion set as well as in the Bayesian language model. The tactics use above exhibits good performances.
What problem does this paper attempt to address?