TrustAI at SemEval-2024 Task 8: A Comprehensive Analysis of Multi-domain Machine Generated Text Detection Techniques

Ashok Urlana,Aditya Saibewar,Bala Mallikarjunarao Garlapati,Charaka Vinayak Kumar,Ajeet Kumar Singh,Srinivasa Rao Chalamala
2024-03-25
Abstract:The Large Language Models (LLMs) exhibit remarkable ability to generate fluent content across a wide spectrum of user queries. However, this capability has raised concerns regarding misinformation and personal information leakage. In this paper, we present our methods for the SemEval2024 Task8, aiming to detect machine-generated text across various domains in both mono-lingual and multi-lingual contexts. Our study comprehensively analyzes various methods to detect machine-generated text, including statistical, neural, and pre-trained model approaches. We also detail our experimental setup and perform a in-depth error analysis to evaluate the effectiveness of these methods. Our methods obtain an accuracy of 86.9\% on the test set of subtask-A mono and 83.7\% for subtask-B. Furthermore, we also highlight the challenges and essential factors for consideration in future studies.
Computer Science
What problem does this paper attempt to address?
This paper explores how to detect machine-generated text in multiple domains and languages, aiming to address issues such as false information and personal information security caused by the widespread use of large-scale language models (LLMs). The paper participated in the SemEval-2024 Task 8 competition and proposed multiple methods including statistical, neural network, and pre-training models, which were comprehensively analyzed and experimented to distinguish between human writing and machine-generated text. The study found that although some models performed well on the development set, they had weak generalization ability on the test set, and proposed the challenges and key factors for future research.