Abstract:Video forgery (VF) is manipulating fake videos by modifying, coordinating or generating new content in the video sequence. The generated fake videos pose a great threat to social stability, enhancing the necessity to introduce fast, efficient video forgery identification techniques. Many existing studies reported effective techniques to detect forged videos at minimal complexity. However, existing techniques failed to obtain global assessments of the entire video frames and are less robust against fast-moving objects. Thus, this article proposes a novel optimized trident encoder-decoder network with adaptive deep learning models for detecting and localizing video forgeries. Initially, the input videos are converted into multiple frames and forwarded to the encoder part of the detection network. In this encoder part, an Attention SqueezeNet (AttSNet) is proposed for obtaining three branches of frames: pristine, forgery and both (pristine and forgery). Then, the bi-directional long short-term memory (Bi-LSTM) model is introduced in the decoder part to accurately detect the presence/absence of forgery from the given video frames. After detection, the forged region is analyzed by proposing a novel Adaptive ResNet (A-ResNet) with a generative adversarial network (GAN) model in the localization process. Here, the features from forgery-detected video frames are extracted by employing A-ResNet and the forged regions are localized effectively by the GAN method. In addition, this study proposes a hybridized wild-hunt (WiH) optimizer technique to update the weight parameters of the proposed model. The proposed method is implemented in the Python platform, and the whole experiment is processed with a face forensic database. In the experimental section, the accuracy of 95.4% and 96.1% and the time complexity of 1.43 s are obtained for detection and localization, respectively.

DEEP-STA: Deep Learning-Based Detection and Localization of Various Types of Inter-Frame Video Tampering Using Spatiotemporal Analysis

Detecting Image Tampering Using Feature Fusion

Two–Stage Detection and Localization of Inter–Frame Tampering in Surveillance Videos Using Texture and Optical Flow

Video tampering localisation using features learned from authentic content

Video Inter-frame Tampering Detection Based on SN-VGG+BiLSTM-AE Composite Model

Spatiotemporal Trident Networks: Detection and Localization of Object Removal Tampering in Video Passive Forensics

UVL2: A Unified Framework for Video Tampering Localization

Detection and localization of multiple inter-frame forgeries in digital videos

Inter-frame Passive-Blind Forgery Detection for Video Shot Based on Similarity Analysis

Real-Time Video Forgery Detection Via Vision-WiFi Silhouette Correspondence

Contour-assistance-based video matting localization

Deep Localization of Mixed Image Tampering Techniques

Exposing video surveillance object forgery by combining TSF features and attention-based deep neural networks

Digital Video Tampering Detection and Localization: Review, Representations, Challenges and Algorithm

Exposing Digital Forgeries in Video via Motion Vectors

Spatio-temporal Features for Generalized Detection of Deepfake Videos

A defensive framework for deepfake detection under adversarial settings using temporal and spatial features

SRTNet: a spatial and residual based two-stream neural network for deepfakes detection

Video forgery detection and localization using optimized attention squeezenet adversarial network

Detecting Violence in Video Based on Deep Features Fusion Technique

A Light Weight Depthwise Separable Layer Optimized CNN Architecture for Object-Based Forgery Detection in Surveillance Videos