Abstract:Over the past year, the field of Natural Language Generation (NLG) has experienced an exponential surge, largely due to the introduction of Large Language Models (LLMs). These models have exhibited the most effective performance in a range of domains within the Natural Language Processing and Generation domains. However, their application in domain-specific tasks, such as paraphrasing, presents significant challenges. The extensive number of parameters makes them difficult to operate on commercial hardware, and they require substantial time for inference, leading to high costs in a production setting. In this study, we tackle these obstacles by employing LLMs to develop three distinct models for the paraphrasing field, applying a method referred to as sequence-level knowledge distillation. These distilled models are capable of maintaining the quality of paraphrases generated by the LLM. They demonstrate faster inference times and the ability to generate diverse paraphrases of comparable quality. A notable characteristic of these models is their ability to exhibit syntactic diversity while also preserving lexical diversity, features previously uncommon due to existing data quality issues in datasets and not typically observed in neural-based approaches. Human evaluation of our models shows that there is only a 4% drop in performance compared to the LLM teacher model used in the distillation process, despite being 1000 times smaller. This research provides a significant contribution to the NLG field, offering a more efficient and cost-effective solution for paraphrasing tasks.

An Empirical Comparison on Imitation Learning and Reinforcement Learning for Paraphrase Generation

Exploring Diverse Expressions for Paraphrase Generation

Paraphrase Generation with Collaboration Between the Forward and the Backward Decoder

Imitating Language via Scalable Inverse Reinforcement Learning

Polly Want a Cracker: Analyzing Performance of Parroting on Paraphrase Generation Datasets

A Deep Generative Framework for Paraphrase Generation

Unsupervised Disentanglement Learning Model for Exemplar-Guided Paraphrase Generation

ConRPG: Paraphrase Generation using Contexts as Regularizer

A Semantically Consistent and Syntactically Variational Encoder-Decoder Framework for Paraphrase Generation.

Revisiting Pivot-Based Paraphrase Generation - Language Is Not the Only Optional Pivot.

In-Context Learning with Reinforcement Learning for Incomplete Utterance Rewriting

Joint Copying and Restricted Generation for Paraphrase.

A Submodular Optimization-Based VAE-Transformer Framework for Paraphrase Generation.

Query and Output: Generating Words by Querying Distributed Word Representations for Paraphrase Generation

Bidirectional Long Short-Term Memory with Gated Relevance Network for Paraphrase Identification

Neural Syntactic Preordering for Controlled Paraphrase Generation

ParaMac: A General Unsupervised Paraphrase Generation Framework Leveraging Semantic Constraints and Diversifying Mechanisms.

Not All Metrics Are Guilty: Improving NLG Evaluation with LLM Paraphrasing

Parameter Efficient Diverse Paraphrase Generation Using Sequence-Level Knowledge Distillation

Learning to Generate Better Than Your LLM

Dialogue Generation: From Imitation Learning to Inverse Reinforcement Learning