Abstract:With the significant advances in deep generative models for image and video synthesis, Deepfakes and manipulated media have raised severe societal concerns. Conventional machine learning classifiers for deepfake detection often fail to cope with evolving deepfake generation technology and are susceptible to adversarial attacks. Alternatively, invisible image watermarking is being researched as a proactive defense technique that allows media authentication by verifying an invisible secret message embedded in the image pixels. A handful of invisible image watermarking techniques introduced for media authentication have proven vulnerable to basic image processing operations and watermark removal attacks. In response, we have proposed a semi-fragile image watermarking technique that embeds an invisible secret message into real images for media authentication. Our proposed watermarking framework is designed to be fragile to facial manipulations or tampering while being robust to benign image-processing operations and watermark removal attacks. This is facilitated through a unique architecture of our proposed technique consisting of critic and adversarial networks that enforce high image quality and resiliency to watermark removal efforts, respectively, along with the backbone encoder-decoder and the discriminator networks. Thorough experimental investigations on SOTA facial Deepfake datasets demonstrate that our proposed model can embed a $64$-bit secret as an imperceptible image watermark that can be recovered with a high-bit recovery accuracy when benign image processing operations are applied while being non-recoverable when unseen Deepfake manipulations are applied. In addition, our proposed watermarking technique demonstrates high resilience to several white-box and black-box watermark removal attacks. Thus, obtaining state-of-the-art performance.

AVSecure: an Audio-Visual Watermarking Framework for Proactive Deepfake Detection

AVoiD-DF: Audio-Visual Joint Learning for Detecting Deepfake

Are Watermarks Bugs for Deepfake Detectors? Rethinking Proactive Forensics

AVForensics: Audio-driven Deepfake Video Detection with Masking Strategy in Self-supervision.

AVT2-DWF: Improving Deepfake Detection with Audio-Visual Fusion and Dynamic Weighting Strategies

V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection

Audio-Visual Temporal Forgery Detection Using Embedding-Level Fusion and Multi-Dimensional Contrastive Loss

A Unified Framework for Modality-Agnostic Deepfakes Detection

Robust Identity Perceptual Watermark Against Deepfake Face Swapping

Audio-visual Deepfake Detection Using Articulatory Representation Learning

Facial Features Matter: a Dynamic Watermark based Proactive Deepfake Detection Approach

SepMark: Deep Separable Watermarking for Unified Source Tracing and Deepfake Detection

Unified Video and Image Representation for Boosted Video Face Forgery Detection

AI-assisted deepfake detection using adaptive blind image watermarking

AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection

Social Media Authentication and Combating Deepfakes using Semi-fragile Invisible Image Watermarking

Pluggable Watermarking of Deepfake Models for Deepfake Detection

AV-Lip-Sync+: Leveraging AV-HuBERT to Exploit Multimodal Inconsistency for Video Deepfake Detection

SafeEar: Content Privacy-Preserving Audio Deepfake Detection

Robust Blind Video Watermarking with Adaptive Embedding Mechanism

Audio Deepfake Detection with Self-Supervised WavLM and Multi-Fusion Attentive Classifier