Abstract:Recently, Large Language Models (LLMs) have been widely studied by researchers for their roles in various downstream NLP tasks. As a fundamental task in the NLP field, Chinese Grammatical Error Correction (CGEC) aims to correct all potential grammatical errors in the input sentences. Previous studies have shown that LLMs' performance as correctors on CGEC remains unsatisfactory due to its challenging task focus. To promote the CGEC field to better adapt to the era of LLMs, we rethink the roles of LLMs in the CGEC task so that they can be better utilized and explored in CGEC. Considering the rich grammatical knowledge stored in LLMs and their powerful semantic understanding capabilities, we utilize LLMs as explainers to provide explanation information for the CGEC small models during error correction to enhance performance. We also use LLMs as evaluators to bring more reasonable CGEC evaluations, thus alleviating the troubles caused by the subjectivity of the CGEC task. In particular, our work is also an active exploration of how LLMs and small models better collaborate in downstream tasks. Extensive experiments and detailed analyses on widely used datasets verify the effectiveness of our thinking intuition and the proposed methods.

What problem does this paper attempt to address?

### Problems the paper attempts to solve This paper mainly explores how to make better use of large language models (LLMs) in the Chinese Grammatical Error Correction (CGEC) task. Specifically, the paper attempts to solve the following key problems: 1. **Poor performance of LLMs in the CGEC task**: - Although LLMs perform well in many natural language processing (NLP) tasks, in the CGEC task, even after supervised fine - tuning, the performance of LLMs is still not satisfactory. The main reason is that the free - generation paradigm of LLMs often makes the generated sentences unable to meet the minimum - change principle required by the CGEC task. 2. **Subjectivity and interpretability of the CGEC task**: - The CGEC task is highly subjective. A grammatically incorrect sentence may have multiple correct correction methods. Therefore, how to design an evaluation framework that can comprehensively and flexibly evaluate these correction results is a challenge. - Meanwhile, the explanation information of the incorrect sentences is crucial for improving the performance of CGEC. High - quality explanation information can provide guidance and help the model better understand the error types and correction methods. 3. **Cooperation between LLMs and small models**: - How to make better use of the advantages of LLMs in downstream tasks while giving play to the high efficiency and adaptability of small models is a problem worthy of exploration. ### Solutions To solve the above problems, the paper proposes two main frameworks: 1. **EXplanation - AugMented training framework (EXAM)**: - **Objective**: Use LLMs as interpreters to provide information such as error types, reference corrections, and error explanations, enhance the training of small models, and thus improve the performance of CGEC. - **Implementation**: Input incorrect sentences into LLMs, obtain the explanation information provided by them, and inject this information into the training process of small models. 2. **SEmantic - incorporated Evaluation framework (SEE)**: - **Objective**: Use LLMs as evaluators, consider complex semantics, and conduct more comprehensive and flexible evaluations of CGEC results. - **Implementation**: By comparing the texts of the incorrect sentences and the predicted sentences, evaluate the effectiveness of the predicted edits from three dimensions (correct edits, incorrect edits, reasonable edits), and thus calculate the precision, recall rate, and F0.5 score. ### Main contributions 1. **Propose SEE**: Through the intervention of LLMs, make the evaluation more adaptable to the subjectivity of the CGEC task. 2. **Propose EXAM**: Use LLMs as interpreters to enhance the training of small models, making them not only outperform LLMs in traditional indicators, but also show competitiveness under the SEE framework. 3. **Relocate the role of LLMs**: Give full play to the advantages of LLMs and promote their adaptability in the CGEC task. 4. **Explore the synergetic cooperation between LLMs and small models**: Reveal how LLMs and small models co - exist and develop together in the era of LLMs and contribute to the progress of downstream tasks. Through these methods, the paper aims to promote the further development of the CGEC field, especially in the context of the era of LLMs.

Rethinking the Roles of Large Language Models in Chinese Grammatical Error Correction

On the (In)Effectiveness of Large Language Models for Chinese Text Correction

Evaluating LLMs' grammatical error correction performance in learner Chinese

Evaluating the Capability of Large-scale Language Models on Chinese Grammatical Error Correction Task

A Chinese Grammatical Error Correction Model Based On Grammatical Generalization And Parameter Sharing

LM-Combiner: A Contextual Rewriting Model for Chinese Grammatical Error Correction

Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies

GrammarGPT: Exploring Open-Source LLMs for Native Chinese Grammatical Error Correction with Supervised Fine-Tuning

Large Language Models Are State-of-the-Art Evaluator for Grammatical Error Correction

Automatically Correcting Large Language Models: Surveying the Landscape of Diverse Automated Correction Strategies

Linguistic Rules-Based Corpus Generation for Native Chinese Grammatical Error Correction

EXCGEC: A Benchmark of Edit-wise Explainable Chinese Grammatical Error Correction

Pillars of Grammatical Error Correction: Comprehensive Inspection Of Contemporary Approaches In The Era of Large Language Models

LM-Critic: Language Models for Unsupervised Grammatical Error Correction

Detection-Correction Structure via General Language Model for Grammatical Error Correction

Large Language Models Cannot Self-Correct Reasoning Yet

Confidence Matters: Revisiting Intrinsic Self-Correction Capabilities of Large Language Models

From Spelling to Grammar: A New Framework for Chinese Grammatical Error Correction

Alirector: Alignment-Enhanced Chinese Grammatical Error Corrector

Prompting open-source and commercial language models for grammatical error correction of English learner text

Harnessing Large Language Models as Post-hoc Correctors