What problem does this paper attempt to address?

The problem that this paper attempts to solve is to embed watermarks in the text generated by large - language models (LLMs) to distinguish whether the text is machine - generated or human - generated, while minimizing the impact on the quality of the generated text under the premise of maintaining the identifiability of the watermark. Specifically, the paper focuses on the trade - off between the **identifiability** and **stealthiness** of the watermark. The author proposes a systematic method to transform this trade - off problem into a multi - objective optimization problem and identify the Pareto optimal solutions associated with a large class of robust and efficient watermarking schemes. Through this method, the paper shows how to improve the identifiability of the watermark without significantly reducing the text quality, thus outperforming the existing default watermarking schemes. ### Key Points Summary: 1. **Problem Background**: With the wide application of large - language models, concerns about their potential misuse are increasing, such as plagiarism, Internet propaganda, cheating in exams, false information dissemination, and copyright infringement. To solve these problems, a possible strategy is to ensure that the text generated by LLM can be distinguished from human - generated text by algorithms, that is, by embedding watermarks. 2. **Watermarking Mechanism**: The paper adopts a watermarking mechanism based on the green - red division of the vocabulary. Before generating each word, the complete vocabulary of the LLM is divided into two mutually exclusive lists, marked as green and red. This division is pseudo - random, and the seed is determined by the previous word. Words in the green list are sampled with a higher probability, while words in the red list are sampled with a lower probability. The detector determines whether the text is generated by the LLM according to the number of words in the green list in the text. 3. **Optimization Objectives**: The paper formalizes the trade - off between the identifiability and stealthiness of the watermark as a multi - objective optimization problem. Specifically, the optimization objective is to maximize the test quality (i.e., the ability to correctly identify the generator) while minimizing the degradation of text quality. 4. **Methods and Contributions**: The author proposes an optimization framework to optimize the above multi - objective problem by adjusting the selection probability of words in the green list. They identify Pareto optimal solutions and verify the effectiveness of these solutions through experiments. The results show that the proposed optimized watermarking scheme outperforms the existing default watermarking schemes in terms of the test - text trade - off. 5. **Experimental Results**: Experiments show that the optimized watermarking scheme not only performs better in identifiability but also performs well in maintaining text quality. In particular, under different test conditions, the optimized watermarking scheme can achieve a high test power while maintaining a low expected log - perplexity. ### Conclusion: By systematically analyzing and optimizing the trade - off between the identifiability and stealthiness of the watermark, the paper provides an effective method to enhance the security and credibility of the text generated by LLM while minimizing the impact on text quality. This provides important references and guidance for future research on embedding watermarks in the text generated by LLM.

Optimizing watermarks for large language models

Unbiased Watermark for Large Language Models

A Watermark for Large Language Models

WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models

Three Bricks to Consolidate Watermarks for Large Language Models

Adaptive Text Watermark for Large Language Models

Necessary and Sufficient Watermark for Large Language Models

WaterJudge: Quality-Detection Trade-off when Watermarking Large Language Models

Provably Robust Watermarks for Open-Source Language Models

A Statistical Framework of Watermarks for Large Language Models: Pivot, Detection Efficiency and Optimal Rules

Baselines for Identifying Watermarked Large Language Models

Mark My Words: Analyzing and Evaluating Language Model Watermarks

Advancing Beyond Identification: Multi-bit Watermark for Large Language Models

Cross-Attention Watermarking of Large Language Models

Token-Specific Watermarking with Enhanced Detectability and Semantic Coherence for Large Language Models

REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative Large Language Models

Watermarking Large Language Models and the Generated Content: Opportunities and Challenges

Improving the Generation Quality of Watermarked Large Language Models via Word Importance Scoring

Universally Optimal Watermarking Schemes for LLMs: from Theory to Practice

A Semantic Invariant Robust Watermark for Large Language Models

On the Reliability of Watermarks for Large Language Models