Combining error oversampling and multi-task learning for Chinese meteorological alert text correction

Kuoyin Wang,Y. Mei,Hanhua Qu,Jianzhong Hui,Muhua Wang,Wei Tang
DOI: https://doi.org/10.1117/12.2639645
2022-05-19
Abstract:Chinese meteorological alert text issued by meteorological authorities needs to be free of spelling errors. Automatic spelling error correction can find text errors and give correction suggestion. Most existing studies focus on open domain text correction such as news etc., however, the methods for vertical domain text correction such as meteorological alert text has not been well studied. In this work, we utilize the template feature of meteorological alert text and propose error oversampling strategy to enhance the correction model training. As for the correction model, we use multi-task learning to train the correction model by accounting error detection and error correction simultaneously. Experimental results on real-world alert texts show that our proposed method is significantly better than the baseline, exceeding the baseline by 3% on F1 measure.
Environmental Science,Engineering,Computer Science
What problem does this paper attempt to address?