Rethinking the Roles of Large Language Models in Chinese Grammatical Error Correction

Yinghui Li,Shang Qin,Haojing Huang,Yangning Li,Libo Qin,Xuming Hu,Wenhao Jiang,Hai-Tao Zheng,Philip S. Yu
2024-09-19
Abstract:Recently, Large Language Models (LLMs) have been widely studied by researchers for their roles in various downstream NLP tasks. As a fundamental task in the NLP field, Chinese Grammatical Error Correction (CGEC) aims to correct all potential grammatical errors in the input sentences. Previous studies have shown that LLMs' performance as correctors on CGEC remains unsatisfactory due to its challenging task focus. To promote the CGEC field to better adapt to the era of LLMs, we rethink the roles of LLMs in the CGEC task so that they can be better utilized and explored in CGEC. Considering the rich grammatical knowledge stored in LLMs and their powerful semantic understanding capabilities, we utilize LLMs as explainers to provide explanation information for the CGEC small models during error correction to enhance performance. We also use LLMs as evaluators to bring more reasonable CGEC evaluations, thus alleviating the troubles caused by the subjectivity of the CGEC task. In particular, our work is also an active exploration of how LLMs and small models better collaborate in downstream tasks. Extensive experiments and detailed analyses on widely used datasets verify the effectiveness of our thinking intuition and the proposed methods.
Computation and Language
What problem does this paper attempt to address?
### Problems the paper attempts to solve This paper mainly explores how to make better use of large language models (LLMs) in the Chinese Grammatical Error Correction (CGEC) task. Specifically, the paper attempts to solve the following key problems: 1. **Poor performance of LLMs in the CGEC task**: - Although LLMs perform well in many natural language processing (NLP) tasks, in the CGEC task, even after supervised fine - tuning, the performance of LLMs is still not satisfactory. The main reason is that the free - generation paradigm of LLMs often makes the generated sentences unable to meet the minimum - change principle required by the CGEC task. 2. **Subjectivity and interpretability of the CGEC task**: - The CGEC task is highly subjective. A grammatically incorrect sentence may have multiple correct correction methods. Therefore, how to design an evaluation framework that can comprehensively and flexibly evaluate these correction results is a challenge. - Meanwhile, the explanation information of the incorrect sentences is crucial for improving the performance of CGEC. High - quality explanation information can provide guidance and help the model better understand the error types and correction methods. 3. **Cooperation between LLMs and small models**: - How to make better use of the advantages of LLMs in downstream tasks while giving play to the high efficiency and adaptability of small models is a problem worthy of exploration. ### Solutions To solve the above problems, the paper proposes two main frameworks: 1. **EXplanation - AugMented training framework (EXAM)**: - **Objective**: Use LLMs as interpreters to provide information such as error types, reference corrections, and error explanations, enhance the training of small models, and thus improve the performance of CGEC. - **Implementation**: Input incorrect sentences into LLMs, obtain the explanation information provided by them, and inject this information into the training process of small models. 2. **SEmantic - incorporated Evaluation framework (SEE)**: - **Objective**: Use LLMs as evaluators, consider complex semantics, and conduct more comprehensive and flexible evaluations of CGEC results. - **Implementation**: By comparing the texts of the incorrect sentences and the predicted sentences, evaluate the effectiveness of the predicted edits from three dimensions (correct edits, incorrect edits, reasonable edits), and thus calculate the precision, recall rate, and F0.5 score. ### Main contributions 1. **Propose SEE**: Through the intervention of LLMs, make the evaluation more adaptable to the subjectivity of the CGEC task. 2. **Propose EXAM**: Use LLMs as interpreters to enhance the training of small models, making them not only outperform LLMs in traditional indicators, but also show competitiveness under the SEE framework. 3. **Relocate the role of LLMs**: Give full play to the advantages of LLMs and promote their adaptability in the CGEC task. 4. **Explore the synergetic cooperation between LLMs and small models**: Reveal how LLMs and small models co - exist and develop together in the era of LLMs and contribute to the progress of downstream tasks. Through these methods, the paper aims to promote the further development of the CGEC field, especially in the context of the era of LLMs.