Abstract:Splicing forgery, which manipulates images by copying regions from donor images and pasting them to host images, is one of the common types of image forgery in life, where the copied regions include object regions or background regions. In order to accurately detect these forgery regions, the most mainstream approach is to use an encoder-decoder network architecture that extracts enough manipulation traces to determine whether each pixel of the input image has been spliced or not. However, due to the limited receptive field of such networks, only local manipulation traces can be learned, and therefore some large object area forgery and background forgery cannot be well localized. To address these issues, in this paper, an end-to-end splicing detection framework is proposed, which includes localization network L-Net, manipulation traces attention network MTA-Net, and adaptive multi-scale fusion module. The localization network L-Net is designed as an encoder-decoder network to extract local manipulation traces for each pixel and implement localization of splicing areas. MTA-Net uses the proposed content-remove convolutional layer (CRCL) to suppress image content information that would hinder the network from learning to manipulate traces, and then uses subsequent convolutional layers to extract features to discriminate whether the input image is a spliced image or not. In this process, the regions in the feature map of the convolutional layers with large activation values are the ones that contain global manipulation traces. These global manipulation traces are fused with the local manipulation traces learned by L-Net through the proposed adaptive multi-scale fusion module (AMSFM), thus allowing L-Net to effectively handle object forgery and background region forgery images of various sizes. Ablation experiments showed an increase of 4.6% and 3.9% in F1-score and MCC after the introduction of MTA-Net and AMSFM, respectively The splicing region detection performance on three standard datasets, CASIA, COLUMB, and CARVALHO, shows that the proposed method outperforms the state-of-the-art methods for both object forgery and background forgery, and is more robust to post-processing methods such as JPEG compression and noise addition.

Feature enhancement and supervised contrastive learning for image splicing forgery detection

Different-quality Re-demosaicing in Digital Image Forensics

End-to-end Image Splicing Localization Based on Multi-Scale Features and Residual Refinement Module

Feature Aggregation and Region-Aware Learning for Detection of Splicing Forgery

Joint Manipulation Trace Attention Network and Adaptive Fusion Mechanism for Image Splicing Forgery Localization

Image‐splicing forgery detection based on local binary patterns of DCT coefficients

Exploring Multi-view Pixel Contrast for General and Robust Image Forgery Localization

ET: Edge-Enhanced Transformer for Image Splicing Detection

Adaptive Multi-Feature Filtration Method for Image Splicing Region Detection

Rethinking Image Forgery Detection via Contrastive Learning and Unsupervised Clustering

DWT and LBP hybrid feature based deep learning technique for image splicing forgery detection

Double-branch forgery image detection based on multi-scale feature fusion

CECL-Net: Contrastive Learning and Edge-Reconstruction-Driven Complementary Learning Network for Image Forgery Localization

Hybrid LSTM and Encoder–Decoder Architecture for Detection of Image Forgeries

Multitask Image Splicing Tampering Detection Based on Attention Mechanism

D-Unet: A Dual-encoder U-Net for Image Splicing Forgery Detection and Localization

Image splicing detection using low-dimensional feature vector of texture features and haralick features based on gray level co-occurrence matrix

Image Forgery Localization via Guided Noise and Multi-Scale Feature Aggregation

Image splicing forgery detection using simplified generalized noise model

A cohesive forgery detection for splicing and copy-paste in digital images

Image Manipulation Localization Using Multi-Scale Feature Fusion and Adaptive Edge Supervision