Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods

Kathleen C. Fraser,Hillary Dawkins,Svetlana Kiritchenko
2024-06-22
Abstract:Large language models (LLMs) have advanced to a point that even humans have difficulty discerning whether a text was generated by another human, or by a computer. However, knowing whether a text was produced by human or artificial intelligence (AI) is important to determining its trustworthiness, and has applications in many domains including detecting fraud and academic dishonesty, as well as combating the spread of misinformation and political propaganda. The task of AI-generated text (AIGT) detection is therefore both very challenging, and highly critical. In this survey, we summarize state-of-the art approaches to AIGT detection, including watermarking, statistical and stylistic analysis, and machine learning classification. We also provide information about existing datasets for this task. Synthesizing the research findings, we aim to provide insight into the salient factors that combine to determine how "detectable" AIGT text is under different scenarios, and to make practical recommendations for future work towards this significant technical and societal challenge.
Computation and Language,Computers and Society
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper primarily explores the issue of detecting AI-generated text (AIGT) and provides a review and analysis of the current research in this field. Specifically: 1. **Background and Importance**: - With the development of large language models (LLMs), it is becoming increasingly difficult to distinguish whether text is generated by humans or computers. - Determining the source of text is crucial for assessing its credibility, especially in detecting fraud, academic misconduct, and combating the spread of misinformation. 2. **Research Objectives**: - Summarize the current state-of-the-art AIGT detection methods, including watermarking techniques, statistical and stylistic analysis, and machine learning classification. - Provide information on existing datasets and synthesize research findings to reveal key factors affecting the detectability of AIGT. - Offer practical recommendations for future work to address this significant technical and social challenge. 3. **Main Content**: - **Task Definition**: Clarify the task of AIGT detection and discuss its key characteristics. - **Classification**: Categorize AIGT into different types ranging from fully automated to highly human-intervened. - **Detection Scenarios**: Describe different types of detection scenarios and their differences. - **Method Overview**: Introduce current NLP methods, divided into watermarking techniques, statistical and stylistic analysis, and pre-trained language model classification. - **Datasets**: List existing datasets available for training and testing AIGT detection systems. - **Influencing Factors**: Discuss various factors affecting the difficulty of AIGT detection, such as characteristics of the generation model, text length, adversarial strategies, etc. - **Conclusions and Recommendations**: Summarize research findings and propose suggestions for future research directions. Through this content, the paper aims to provide researchers and technical practitioners with a comprehensive guide to help them choose the most appropriate detection methods and training datasets for specific applications. As LLMs become more prevalent in daily life, AIGT detection will become an important issue that requires collaborative efforts to solve.