Video deepfake detection using Particle Swarm Optimization improved deep neural networks

Leandro Cunha,Li Zhang,Bilal Sowan,Chee Peng Lim,Yinghui Kong
DOI: https://doi.org/10.1007/s00521-024-09536-x
2024-02-22
Neural Computing and Applications
Abstract:Abstract As complexity and capabilities of Artificial Intelligence technologies increase, so does its potential for misuse. Deepfake videos are an example. They are created with generative models which produce media that replicates the voices and faces of real people. Deepfake videos may be entertaining, but they may also put privacy and security at risk. A criminal may forge a video of a politician or another notable person in order to affect public opinions or deceive others. Approaches for detecting and protecting against these types of forgery must evolve as well as the methods of generation to ensure that proper information is supplied and to mitigate the risks associated with the fast evolution of deepfakes. This research exploits the effectiveness of deepfake detection algorithms with the application of a Particle Swarm Optimization (PSO) variant for hyperparameter selection. Since Convolutional Neural Networks excel in recognizing objects and patterns in visual data while Recurrent Neural Networks are proficient at handling sequential data, in this research, we propose a hybrid EfficientNet-Gated Recurrent Unit (GRU) network as well as EfficientNet-B0-based transfer learning for video forgery classification. A new PSO algorithm is proposed for hyperparameter search, which incorporates composite leaders and reinforcement learning-based search strategy allocation to mitigate premature convergence. To assess whether an image or a video is manipulated, both models are trained on datasets containing deepfake and genuine photographs and videos. The empirical results indicate that the proposed PSO-based EfficientNet-GRU and EfficientNet-B0 networks outperform the counterparts with manual and optimal learning configurations yielded by other search methods for several deepfake datasets.
computer science, artificial intelligence
What problem does this paper attempt to address?
The problem this paper attempts to address is the detection of Deepfake videos. As the complexity and capabilities of artificial intelligence technology continue to increase, so do the potential risks of its misuse. Deepfake videos replicate the voices and faces of real people through generative models, which, while potentially entertaining, can also pose threats to privacy and security. For example, criminals could forge videos of politicians or other well-known individuals to influence public opinion or deceive others. Therefore, researchers need to develop effective detection methods to ensure the authenticity of information and reduce the risks posed by Deepfakes. To tackle this challenge, the study proposes a Deep Neural Network (DNN) improved by Particle Swarm Optimization (PSO) to detect Deepfake videos. Specifically, the researchers propose a hybrid EfficientNet-Gated Recurrent Unit (GRU) network and a transfer learning model based on EfficientNet-B0 for video forgery classification. Additionally, a new PSO algorithm is proposed for hyperparameter selection to enhance the model's performance. By training these models on datasets containing both Deepfake and real photos and videos, the experimental results show that the proposed PSO-based EfficientNet-GRU and EfficientNet-B0 networks outperform other manual and optimal learning configuration methods on multiple Deepfake datasets.