Abstract:Recent advancements in natural language generation (NLG) technology have significantly improved the diversity, control, and quality of large language models (LLM)-generated text. A notable example is OpenAI's ChatGPT, which demonstrates exceptional performance in tasks such as answering questions, composing email messages, essays, and codes. However, this newfound capability to produce human-like text at high efficiency also raises concerns about detecting and preventing misuse of LLMs in tasks such as phishing, disinformation, and academic dishonesty. For instance, many schools banned ChatGPT due to concerns over cheating in assignments, 11 and media outlets have raised the alarm over fake news generated by LLMs. 14 These concerns about the misuse of LLMs have hindered the NLG application in important domains such as media and education. The ability to accurately detect LLM-generated text is critical for realizing the full potential of NLG while minimizing serious consequences. From the perspective of the end users, LLM-generated text detection could increase trust in NLG systems and encourage adoption. For machine learning system developers and researchers, the detector can aid in tracing generated text and preventing unauthorized use. Given its significance, there has been a growing interest in academia and industry to pursue research on LLM-generated text detection and to deepen our understanding of its underlying mechanisms. Key Insights Existing LLM-generated text detection methods can be generally grouped into two categories: black-box detection and white-box detection. Black-box detection involves using API-level access to interact with and analyze LLM outputs. In contrast, white-box detection grants full access to the LLMs, enabling control over the model's generation behavior to enhance detectability. While black-box detection works at present due to detectable signals left by language models in generated text, it will gradually become less viable as language model capabilities advance and ultimately become infeasible. White-box detection methods are based upon the assumption that the LLM is controlled by the developers and offered as a service to end-users. However, the possibility of developers open-sourcing their LLMs poses a challenge to these detection approaches. While there is a rising discussion on whether LLM-generated text could be properly detected and how this can be done, we provide a comprehensive technical introduction of existing detection methods that can be grouped into two general categories: black-box detection and white-box detection. Black-box detection methods are limited to API-level access to LLMs. They rely on collecting text samples from human and machine sources, respectively, to train a classification model that can be used to discriminate between LLM- and human-generated text. Black-box detectors work well because current LLM-generated text often show linguistic or statistical patterns. However, as LLMs evolve and improve, black-box methods are becoming less effective. An alternative is white-box detection: In this scenario, the detector has full access to the LLMs and can control the model's generation behavior for traceability purposes. In practice, black-box detectors are commonly constructed by external entities, whereas white-box detection is generally carried out by LLM developers. This article is to discuss the timely topic from a data mining and natural language processing perspective. We first outline the black-box detection methods in terms of a data analytic life cycle, including data collection, feature selection, and classification model design. We then delve into more recent advancements in white-box detection methods, such as post-hoc watermarks and inference time watermarks. Finally, we present the limitations and concerns of current detection studies and suggest potential future research avenues. We aim to unleash the potential of powerful LLMs by providing fundamental concepts, algorithms, and case studies for detecting LLM-generated text. Prevalence and Impact Recent advancements in LLMs, such as OpenAI's ChatGPT, have emphasized the potential impacts of this technology on individuals and society. Demonstrated through its performance on challenging tests, such as the MBA exams at Wharton Business School, 31 the capabilities of ChatGPT suggest its potential to provide professional assistance across various disciplines. In the healthcare domain, for example, the applications of ChatGPT extend far beyond simple enhancements in efficiency. ChatGPT not only optimizes documentation procedures, facilitating the generati -Abstract Truncated-

A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions

The Science of Detecting LLM-Generated Texts

A Survey on Detection of LLMs-Generated Content

Beyond Binary: Towards Fine-Grained LLM-Generated Text Detection via Role Recognition and Involvement Measurement

Robust Detection of LLM-Generated Text: A Comparative Analysis

DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios

LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection

Towards Possibilities & Impossibilities of AI-generated Text Detection: A Survey

Unveiling Large Language Models Generated Texts: A Multi-Level Fine-Grained Detection Framework

A Survey of Text Watermarking in the Era of Large Language Models

Detecting Multimedia Generated by Large AI Models: A Survey

MAGE: Machine-generated Text Detection in the Wild

A Survey of AI-generated Text Forensic Systems: Detection, Attribution, and Characterization

LLM-Detector: Improving AI-Generated Chinese Text Detection with Open-Source LLM Instruction Tuning

Detecting Machine-Generated Texts: Not Just "AI vs Humans" and Explainability is Complicated

The Science of Detecting LLM-Generated Text

Beat LLMs at Their Own Game: Zero-Shot LLM-Generated Text Detection Via Querying ChatGPT.

DALD: Improving Logits-based Detector without Logits from Black-box LLMs

Improving Logits-based Detector without Logits from Black-box LLMs

On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey