An LLM-Enhanced Adversarial Editing System for Lexical Simplification

Keren Tan,Kangyang Luo,Yunshi Lan,Zheng Yuan,Jinlong Shu
2024-02-23
Abstract:Lexical Simplification (LS) aims to simplify text at the lexical level. Existing methods rely heavily on annotated data, making it challenging to apply in low-resource scenarios. In this paper, we propose a novel LS method without parallel corpora. This method employs an Adversarial Editing System with guidance from a confusion loss and an invariance loss to predict lexical edits in the original sentences. Meanwhile, we introduce an innovative LLM-enhanced loss to enable the distillation of knowledge from Large Language Models (LLMs) into a small-size LS system. From that, complex words within sentences are masked and a Difficulty-aware Filling module is crafted to replace masked positions with simpler words. At last, extensive experimental results and analyses on three benchmark LS datasets demonstrate the effectiveness of our proposed method.
Computation and Language
What problem does this paper attempt to address?
The paper aims to address the issues in the task of Lexical Simplification (LS), particularly in low-resource scenarios where parallel corpora are lacking. Existing methods heavily rely on annotated data, which is very challenging to apply in low-resource scenarios. Therefore, the paper proposes a new LS method that can simplify the vocabulary in original sentences without the need for parallel corpora. Specifically, the paper proposes an enhanced adversarial editing system (LAE-LS), which includes two modules: 1. **Adversarial Editing Module**: Predicts lexical edits through confusion loss and invariance loss, and introduces large language models (LLMs) enhanced loss to extract supervision signals from large language models. 2. **Difficulty-Aware Filling Module**: Replaces masked complex vocabulary to ensure that the replaced vocabulary is simpler. Experimental results show that this method achieves significant results on three benchmark LS datasets and performs well even with a parameter size much smaller than GPT-3.5-turbo.