RU-AI: A Large Multimodal Dataset for Machine Generated Content Detection

Liting Huang,Zhihao Zhang,Yiran Zhang,Xiyue Zhou,Shoujin Wang

2024-06-07

Abstract:The recent advancements in generative AI models, which can create realistic and human-like content, are significantly transforming how people communicate, create, and work. While the appropriate use of generative AI models can benefit the society, their misuse poses significant threats to data reliability and authentication. However, due to a lack of aligned multimodal datasets, effective and robust methods for detecting machine-generated content are still in the early stages of development. In this paper, we introduce RU-AI, a new large-scale multimodal dataset designed for the robust and efficient detection of machine-generated content in text, image, and voice. Our dataset is constructed from three large publicly available datasets: Flickr8K, COCO, and Places205, by combining the original datasets and their corresponding machine-generated pairs. Additionally, experimental results show that our proposed unified model, which incorporates a multimodal embedding module with a multilayer perceptron network, can effectively determine the origin of the data (i.e., original data samples or machine-generated ones) from RU-AI. However, future work is still required to address the remaining challenges posed by RU-AI. The source code and dataset are available at <a class="link-external link-https" href="https://github.com/ZhihaoZhang97/RU-AI" rel="external noopener nofollow">this https URL</a>.

Computer Vision and Pattern Recognition,Artificial Intelligence

What problem does this paper attempt to address?

The problem this paper attempts to address is how to effectively detect content generated by generative AI models. With the development of generative AI models, these models are capable of creating realistic and human-like content, which has significantly changed the way people communicate, create, and work. However, the misuse of these models poses a significant threat to the reliability and authenticity of data. Due to the lack of aligned multimodal datasets, effective and robust methods for detecting machine-generated content are still in the early stages of development. Therefore, this paper introduces the RU-AI dataset, a large-scale multimodal dataset aimed at robust and efficient detection of machine-generated content through various forms such as text, images, and speech. Additionally, experimental results show that the proposed unified model (which combines a multimodal embedding module of multilayer perceptron networks) can effectively identify the data sources in RU-AI (i.e., original data samples or machine-generated data). However, future work still needs to address the remaining challenges posed by RU-AI.

RU-AI: A Large Multimodal Dataset for Machine Generated Content Detection

MiRAGeNews: Multimodal Realistic AI-Generated News Detection

AiGen-FoodReview: A Multimodal Dataset of Machine-Generated Restaurant Reviews and Images on Social Media

Detecting Multimedia Generated by Large AI Models: A Survey

GenImage: A Million-Scale Benchmark for Detecting AI-Generated Image

Distinguish Any Fake Videos: Unleashing the Power of Large-scale Data and Motion Features

AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness Benchmark

Improving Interpretability and Robustness for the Detection of AI-Generated Images

SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model

A Sanity Check for AI-generated Image Detection

Are AI Detectors Good Enough? A Survey on Quality of Datasets With Machine-Generated Texts

DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark

The Adversarial AI-Art: Understanding, Generation, Detection, and Benchmarking

WildFake: A Large-scale Challenging Dataset for AI-Generated Images Detection

Enhancing Text Authenticity: A Novel Hybrid Approach for AI-Generated Text Detection

Introduction to the Special Issue on AI-Generated Content for Multimedia

A Large-scale Universal Evaluation Benchmark For Face Forgery Detection

Harnessing Machine Learning for Discerning AI-Generated Synthetic Images

Community Forensics: Using Thousands of Generators to Train Fake Image Detectors