Face Forgery Detection Based on Fine-grained Clues and Noise Inconsistency

Dengyong Zhang,Ruiyi He,Xin Liao,Feng Li,Jiaxin Chen,Gaobo Yang
DOI: https://doi.org/10.1109/tai.2024.3455311
2024-01-01
IEEE Transactions on Artificial Intelligence
Abstract:Deepfake detection has gained increasingly research attentions in media forensics, and a variety of works have been produced. However, subtle artifacts might be eliminated by compression, and the Convolutional Neural Networks (CNN) based detectors are invalidated for fake face images with compression. In this work, we propose a two-stream network for deepfake detection. We observed that high-frequency noise features and spatial features are inherently complementary to each other. Thus, both spatial features and high-frequency noise features are exploited for face forgery detection. Specifically, we design a Double-Frequency Transformer Module (DFTM) to guide the learning of spatial features from local artifact regions. To effectively fuse spatial features and high-frequency noise features, a Dual Domain Attention Fusion Module (DDAFM) is designed. We also introduce a local relationship constraint loss, which requires only image-level labels, for model training. We evaluate the proposed approach on five large-scale benchmark datasets, and extensive experimental results demonstrate the proposed approach outperforms most SOTA works. Code will be provided at https://github.com/hryyyy/HILIF .
What problem does this paper attempt to address?