The Science of Detecting LLM-Generated Texts

Ruixiang Tang,Yu-Neng Chuang,Xia Hu
2023-06-03
Abstract:The emergence of large language models (LLMs) has resulted in the production of LLM-generated texts that is highly sophisticated and almost indistinguishable from texts written by humans. However, this has also sparked concerns about the potential misuse of such texts, such as spreading misinformation and causing disruptions in the education system. Although many detection approaches have been proposed, a comprehensive understanding of the achievements and challenges is still lacking. This survey aims to provide an overview of existing LLM-generated text detection techniques and enhance the control and regulation of language generation models. Furthermore, we emphasize crucial considerations for future research, including the development of comprehensive evaluation metrics and the threat posed by open-source LLMs, to drive progress in the area of LLM-generated text detection.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The paper aims to explore and address the issue of detecting text generated by large language models (LLMs). With the advancement of LLMs technology, models such as OpenAI's ChatGPT can generate highly realistic human-like text, raising concerns about the potential misuse of these technologies, such as spreading misinformation, educational cheating, etc. Although various detection methods have been proposed, a systematic understanding and evaluation are still lacking. This paper provides an overview of existing LLM-generated text detection techniques and highlights key considerations for future research, including the development of comprehensive evaluation metrics and the threats posed by open-source LLMs, to advance progress in this field. Specifically, the paper first introduces black-box detection methods, which rely solely on API-level access to LLMs, distinguishing between human and machine-generated text samples by training classifiers. However, as LLMs continue to improve, the effectiveness of black-box methods gradually declines. Additionally, white-box detection methods are discussed, where detectors have full access to LLMs, allowing control over their generation behavior to achieve traceability. The paper further analyzes the applications and potential impacts of LLMs in different fields, such as healthcare, law, journalism, and cybersecurity, emphasizing the importance of developing effective detection tools. Finally, the paper proposes future research directions, including enhancing the performance of detection systems, improving interpretability, and addressing emerging challenges.