An Adversarial Multi-Task Learning Method for Chinese Text Correction with Semantic Detection

Fanyu Wang,Zhenping Xie

DOI: https://doi.org/10.48550/arXiv.2306.16313

2023-06-28

Computation and Language

Abstract:Text correction, especially the semantic correction of more widely used scenes, is strongly required to improve, for the fluency and writing efficiency of the text. An adversarial multi-task learning method is proposed to enhance the modeling and detection ability of character polysemy in Chinese sentence context. Wherein, two models, the masked language model and scoring language model, are introduced as a pair of not only coupled but also adversarial learning tasks. Moreover, the Monte Carlo tree search strategy and a policy network are introduced to accomplish the efficient Chinese text correction task with semantic detection. The experiments are executed on three datasets and five comparable methods, and the experimental results show that our method can obtain good performance in Chinese text correction task for better semantic rationality.

What problem does this paper attempt to address?

The paper aims to address the issue of complex semantic errors in Chinese text correction tasks. Specifically, the authors propose a new adversarial multi-task learning method to uniformly handle character-level and phrase-level correction problems. This method improves the model's ability to model and detect the polysemy of Chinese characters by introducing a masked language model and a scoring language model as a pair of adversarial learning tasks. Additionally, to achieve efficient Chinese text correction tasks, the paper introduces the Monte Carlo Tree Search (MCTS) strategy and policy network to enhance the computational efficiency and accuracy of error location search. Experimental results show that this method outperforms existing methods on 3 datasets and has significant advantages in correction scenarios of different lengths. Overall, the main contribution of the paper is the proposal of a new framework that can effectively handle complex semantic errors in Chinese text.

An Adversarial Multi-Task Learning Method for Chinese Text Correction with Semantic Detection

Integrated Semantic and Phonetic Post-correction for Chinese Speech Recognition

Combining error oversampling and multi-task learning for Chinese meteorological alert text correction

An Alignment-Agnostic Model for Chinese Text Error Correction

Adversarial Multi-Task Learning for Efficient Chinese Named Entity Recognition

On the (In)Effectiveness of Large Language Models for Chinese Text Correction

Multi-Task Fine-Tuning on BERT Using Spelling Errors Correction for Chinese Text Classification Robustness

Improving Pre-trained Language Models with Syntactic Dependency Prediction Task for Chinese Semantic Error Recognition

Winnow-based approach in automatic error detection and correction of Chinese text

Research on Chinese Text Error Correction Based on Sequence Model

MCSSpell:Optimal Path Selection of Candidate Characters by Integrating Multimodal Information and Copy Mechanism for Chinese Spelling Correction.

An Error-Guided Correction Model for Chinese Spelling Error Correction

A Tree-Structure Analysis Network on Handwritten Chinese Character Error Correction

Multi-task Learning for Chinese Word Usage Errors Detection

Short text matching model with multiway semantic interaction based on multi-granularity semantic embedding

Adversarial Multi-Criteria Learning for Chinese Word Segmentation

Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models

PSDSpell: Pre-Training with Self-Distillation Learning for Chinese Spelling Correction

Spelling Error Correction with Soft-Masked BERT

WordChange: Adversarial Examples Generation Approach for Chinese Text Classification