Adt: anti-deepfake transformer

Ping Wang,Kunlin Liu,Wenbo Zhou,Hang Zhou,Honggu Liu,Weiming Zhang,Nenghai Yu
DOI: https://doi.org/10.1109/ICASSP43922.2022.9746888
2022-01-01
Abstract:Recently almost all the mainstream deepfake detection methods use Convolutional Neural Networks (CNN) as their backbone. However, due to the overreliance on local texture information which is usually determined by forgery methods of training data, these CNN-based methods cannot generalize well to unseen data. To get out of the predicament of prior methods, in this paper, we propose a novel transformer-based framework to model both global and local information and analyze anomalies of face images. In particular, we design attention leading module, multi-forensics module and variant residual connections for deepfake detection, and leverage token-level contrast loss for more detailed supervision. Experiments on almost all popular public deepfake datasets demonstrate that our method achieves state-of-the-art performance in cross-dataset evaluation and comparable performance in intra-dataset evaluation.
What problem does this paper attempt to address?