Fusing Multi-scale Attention and Transformer for Detection and Localization of Image Splicing Forgery

Yanzhi Xu,Jiangbin Zheng,Chenyu Shao
DOI: https://doi.org/10.1007/978-981-97-1417-9_31
2024-01-01
Abstract:Image forgery detection has attracted widespread attention due to the enormous spread of forged images on the internet and social media. Many existing methods lack global contextual information and ignore the interaction between multi-level representations, which are not conducive to localizing the multi-scale tampered regions. To overcome these limitations, we present a novel method to detect splicing forgery of images by utilizing cross-scale interaction attention and multi-level global information. Firstly, we design a cross-scale interactive attention (CSIA) module for aggregating multi-level convolutional features, focusing selectively on task-relevant information. It allows representations learned at different levels to communicate effectively with each other. Secondly, by introducing the transformer layers with dynamic position embedding, the proposed method can capture the multi-level global contextual correlations of the image in a more flexible way. Extensive experiments have shown that our proposed method outperforms state-of-the-art methods. It can effectively detect, and segment multi-scale tampered regions by aggregating multi-scale local and global feature relevance.
What problem does this paper attempt to address?