DeepfakeBench: A Comprehensive Benchmark of Deepfake Detection

Zhiyuan Yan,Yong Zhang,Xinhang Yuan,Siwei Lyu,Baoyuan Wu

2023-10-28

Abstract:A critical yet frequently overlooked challenge in the field of deepfake detection is the lack of a standardized, unified, comprehensive benchmark. This issue leads to unfair performance comparisons and potentially misleading results. Specifically, there is a lack of uniformity in data processing pipelines, resulting in inconsistent data inputs for detection models. Additionally, there are noticeable differences in experimental settings, and evaluation strategies and metrics lack standardization. To fill this gap, we present the first comprehensive benchmark for deepfake detection, called DeepfakeBench, which offers three key contributions: 1) a unified data management system to ensure consistent input across all detectors, 2) an integrated framework for state-of-the-art methods implementation, and 3) standardized evaluation metrics and protocols to promote transparency and reproducibility. Featuring an extensible, modular-based codebase, DeepfakeBench contains 15 state-of-the-art detection methods, 9 deepfake datasets, a series of deepfake detection evaluation protocols and analysis tools, as well as comprehensive evaluations. Moreover, we provide new insights based on extensive analysis of these evaluations from various perspectives (e.g., data augmentations, backbones). We hope that our efforts could facilitate future research and foster innovation in this increasingly critical domain. All codes, evaluations, and analyses of our benchmark are publicly available at <a class="link-external link-https" href="https://github.com/SCLBD/DeepfakeBench" rel="external noopener nofollow">this https URL</a>.

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The paper focuses on the challenges in the field of Deepfake detection, specifically the lack of a unified and comprehensive benchmarking framework. The current issues include inconsistent data processing, variations in experimental settings, and non-standardized evaluation criteria and metrics. To address these problems, researchers propose DeepfakeBench, a comprehensive benchmarking platform that provides the following three key contributions: 1. A unified data management system to ensure consistency in inputs from all detectors. 2. An integrated framework to implement state-of-the-art detection methods, enabling direct comparisons between different approaches. 3. Standardized evaluation metrics and protocols to promote transparency and repeatability. DeepfakeBench includes 15 state-of-the-art detection methods, 9 deepfake datasets, a range of evaluation protocols and analysis tools, as well as comprehensive evaluations. Furthermore, by conducting multidimensional analyses, it offers new insights to drive future research and innovation in this critical domain. The goal of the paper is to establish a fair platform for comparing deepfake detection techniques, aiming to enhance the reproducibility and accuracy of research while uncovering various factors affecting detection performance, thus facilitating technological advancements and preventing potential misuse.

DeepfakeBench: A Comprehensive Benchmark of Deepfake Detection

Towards Benchmarking and Evaluating Deepfake Detection

DF40: Toward Next-Generation Deepfake Detection

Deepfake Generation and Detection: A Benchmark and Survey

Deepfake: Definitions, Performance Metrics and Standards, Datasets and Benchmarks, and a Meta-Review

Assessment framework for deepfake detection in real-world situations

A Survey on Deepfake Video Detection

DEEPFAKER: A Unified Evaluation Platform for Facial Deepfake and Detection Models

Penny-Wise and Pound-Foolish in Deepfake Detection

WildDeepfake: A Challenging Real-World Dataset for Deepfake Detection

Deepfake Detection: A Comprehensive Study from the Reliability Perspective

Improving Fairness in Deepfake Detection

Deepfake Detection: A Comprehensive Survey from the Reliability Perspective

Deepfake Videos in the Wild: Analysis and Detection

DeepFake-O-Meter v2.0: An Open Platform for DeepFake Detection

VoiceWukong: Benchmarking Deepfake Voice Detection

FineFake: A Knowledge-Enriched Dataset for Fine-Grained Multi-Domain Fake News Detection

SoK: Facial Deepfake Detectors

Not Second Class: The First Class II MHC Crystal Structure

Impact of Video Processing Operations in Deepfake Detection