Transformer and Hybrid Deep Learning Based Models for Machine-Generated Text Detection

Teodor-George Marchitan,Claudiu Creanga,Liviu P. Dinu

2024-05-28

Abstract:This paper describes the approach of the UniBuc - NLP team in tackling the SemEval 2024 Task 8: Multigenerator, Multidomain, and Multilingual Black-Box Machine-Generated Text Detection. We explored transformer-based and hybrid deep learning architectures. For subtask B, our transformer-based model achieved a strong \textbf{second-place} out of $77$ teams with an accuracy of \textbf{86.95\%}, demonstrating the architecture's suitability for this task. However, our models showed overfitting in subtask A which could potentially be fixed with less fine-tunning and increasing maximum sequence length. For subtask C (token-level classification), our hybrid model overfit during training, hindering its ability to detect transitions between human and machine-generated text.

Computation and Language

What problem does this paper attempt to address?

The paper aims to address the issue of distinguishing between human-generated text and AI-generated text. Specifically, the research team (UniBuc-NLP) participated in SemEval 2024 Task 8, which is a multi-generator, multi-domain, and multi-language black-box machine-generated text detection challenge. By developing tools capable of identifying the differences between these two types of text, it is possible to maintain the authenticity and integrity of information, prevent the spread of misinformation, and ensure the traceability of content sources. This is crucial for combating unethical AI uses such as propaganda, misinformation, deepfakes, and social manipulation. The research team employed Transformer-based and hybrid deep learning architectures to tackle different subtasks. For Subtask B, their Transformer-based model achieved 2nd place with an accuracy of 86.95%, demonstrating the suitability of this architecture for the task. However, in Subtask A, the model experienced overfitting; and in Subtask C (token-level classification), the hybrid model also encountered overfitting during training, which affected its ability to detect transitions between human and machine-generated text.

Transformer and Hybrid Deep Learning Based Models for Machine-Generated Text Detection

MasonTigers at SemEval-2024 Task 8: Performance Analysis of Transformer-based Models on Machine-Generated Text Detection

HU at SemEval-2024 Task 8A: Can Contrastive Learning Learn Embeddings to Detect Machine-Generated Text?

Fine-tuning Large Language Models for Multigenerator, Multidomain, and Multilingual Machine-Generated Text Detection

Sharif-MGTD at SemEval-2024 Task 8: A Transformer-Based Approach to Detect Machine Generated Text

SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection

DeepPavlov at SemEval-2024 Task 8: Leveraging Transfer Learning for Detecting Boundaries of Machine-Generated Texts

AISPACE at SemEval-2024 task 8: A Class-balanced Soft-voting System for Detecting Multi-generator Machine-generated Text

UPB at IberLEF-2023 AuTexTification: Detection of Machine-Generated Text using Transformer Ensembles

Stacking the Odds: Transformer-Based Ensemble for AI-Generated Text Detection

KInIT at SemEval-2024 Task 8: Fine-tuned LLMs for Multilingual Machine-Generated Text Detection

Mast Kalandar at SemEval-2024 Task 8: On the Trail of Textual Origins: RoBERTa-BiLSTM Approach to Detect AI-Generated Text

TrustAI at SemEval-2024 Task 8: A Comprehensive Analysis of Multi-domain Machine Generated Text Detection Techniques

Transformer-based approaches to Sentiment Detection

A Simple yet Efficient Ensemble Approach for AI-generated Text Detection

MULTITuDE: Large-Scale Multilingual Machine-Generated Text Detection Benchmark

Robust AI-Generated Text Detection by Restricted Embeddings

A Comprehensive Analysis of Transformer-Deep Neural Network Models in Twitter Disaster Detection

Understanding Transformers for Bot Detection in Twitter

Enhancing Text Authenticity: A Novel Hybrid Approach for AI-Generated Text Detection

Extreme Multi-Domain, Multi-Task Learning With Unified Text-to-Text Transfer Transformers