DBCERT: Reconstruct the BERT Model for Chinese Spelling Correction

Rongpeng Li,Xiaohong Xiang,Bo Hu,Xin Deng,Jun Zhao
DOI: https://doi.org/10.1109/ispce-asia64773.2024.10756270
2024-01-01
Abstract:Chinese spelling correction (CSC) constitutes a research field of high practical significance. However, most studies proposed a large number of auxiliary modules to fulfill the design goals, resulting in increased complexity in the model structure and the number of parameters. In this paper, we propose a model named DBCERT, which constructs detection and correction modules by reconstructing BERT, endowing it with strong capabilities to perceive and correct spelling errors. Additionally, similarity samples are proposed to train the model's ability to perceive the similar relationship between characters. Without adding any auxiliary modules, DBCERT achieves excellent Chinese spelling correction performance, which represents a significant improvement compared to other works.
What problem does this paper attempt to address?