Abstract:Image forgery detection aims to detect and locate forged regions in an image. Most existing forgery detection algorithms formulate classification problems to classify pixels into forged or pristine. However, the definition of forged and pristine pixels is only relative within one single image, e.g., a forged region in image A is actually a pristine one in its source image B (splicing forgery). Such a relative definition has been severely overlooked by existing methods, which unnecessarily mix forged (pristine) regions across different images into the same category. To resolve this dilemma, we propose the FOrensic ContrAstive cLustering (FOCAL) method, a novel, simple yet very effective paradigm based on contrastive learning and unsupervised clustering for the image forgery detection. Specifically, FOCAL 1) utilizes pixel-level contrastive learning to supervise the high-level forensic feature extraction in an image-by-image manner, explicitly reflecting the above relative definition; 2) employs an on-the-fly unsupervised clustering algorithm (instead of a trained one) to cluster the learned features into forged/pristine categories, further suppressing the cross-image influence from training data; and 3) allows to further boost the detection performance via simple feature-level concatenation without the need of retraining. Extensive experimental results over six public testing datasets demonstrate that our proposed FOCAL significantly outperforms the state-of-the-art competing algorithms by big margins: +24.3% on Coverage, +18.6% on Columbia, +17.5% on FF++, +14.2% on MISD, +13.5% on CASIA and +10.3% on NIST in terms of IoU. The paradigm of FOCAL could bring fresh insights and serve as a novel benchmark for the image forgery detection task. The code is available at <a class="link-external link-https" href="https://github.com/HighwayWu/FOCAL" rel="external noopener nofollow">this https URL</a>.

Language-guided Hierarchical Fine-grained Image Forgery Detection and Localization

Hierarchical Fine-Grained Image Forgery Detection and Localization

Image Forgery Localization via Guided Noise and Multi-Scale Feature Aggregation

AdaIFL: Adaptive Image Forgery Localization Via a Dynamic and Importance-Aware Transformer Network

ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization

End-to-end Image Splicing Localization Based on Multi-Scale Features and Residual Refinement Module

Noise-assisted Prompt Learning for Image Forgery Detection and Localization

DA-HFNet: Progressive Fine-Grained Forgery Image Detection and Localization Based on Dual Attention

CECL-Net: Contrastive Learning and Edge-Reconstruction-Driven Complementary Learning Network for Image Forgery Localization

Hierarchical Forgery Classifier On Multi-modality Face Forgery Clues

Unified Video and Image Representation for Boosted Video Face Forgery Detection

FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models

AGIL-SwinT: Attention-guided Inconsistency Learning for Face Forgery Detection

Attention-guided Fine-grained Feature Learning For Robust Face Forgery Detection

DMFF-Net: Double-stream multilevel feature fusion network for image forgery localization

Image Forgery Detection with Interpretability

Collaborative Feature Learning for Fine-grained Facial Forgery Detection and Segmentation

Rethinking Image Forgery Detection via Contrastive Learning and Unsupervised Clustering

RIFD-Net: A Robust Image Forgery Detection Network

Learning Discriminative Noise Guidance for Image Forgery Detection and Localization

Hierarchical Frequency-Assisted Interactive Networks for Face Manipulation Detection