Double-Stream Segmentation Network with Temporal Self-attention for Deepfake Video Detection

Beibei Liu,Yifei Gao,Yongjian Hu,Jingjing Guo,Yufei Wang
DOI: https://doi.org/10.1007/978-3-030-95398-0_3
2022-01-01
Abstract:As deep learning technology develops rapidly, the adverse impact of counterfeiting with deep learning is expanding. It has become a hot research topic to explore ways of detecting deepfakes, i.e., fake images or videos generated with deep learning methods. The results reported in the literature show that many existing detection methods deteriorate severely in cross-dataset tests. To solve this problem, we propose a novel network that embodies three key features: 1) double-stream feature extraction module to reveal artifacts simultaneously from the original image space and the noise residual space; 2) temporal self-attention module to infer the authenticity of the current frame from the temporal context; 3) tampering region segmentation module that is based on fully convolutional network and predicts the tampered face area in each frame. We also propose a modified IoU (Intersection-over-Union) measurement of the predicted tampering region against the face region, upon which the authenticity of a frame is determined. We conduct comprehensive comparative experiments on five major datasets and demonstrate the superiority of our proposed model, particularly in the challenging cross-dataset settings.
What problem does this paper attempt to address?