Ldsfnet: lightweight dynamic selection fusion network for face forgery detection
Wen, Shengcong,Qi, Yongfeng
DOI: https://doi.org/10.1007/s11760-024-03692-2
IF: 1.583
2024-12-10
Signal Image and Video Processing
Abstract:Due to the serious security issues caused by face manipulation technology, face forgery detection has received widespread attention. Although existing detection models have achieved impressive results, they still struggle to find the proper balance between detection accuracy and model complexity. To solve this problem, we propose a lightweight dynamic selection fusion network (LDSFNet) to achieve a highly accurate lightweight face forgery detection model. Specifically, we design a two-branch network to capture subtle artifacts in spatial texture features and high-frequency noise features. Firstly, for the spatial texture capture branch, we design a texture feature enhancement (TFE) module, which facilitates the detection performance of the network by extracting the texture difference information between the global texture features and the local texture features, and also introduce a spatial group-wise enhance (SGE) module in the backbone network in order to enhance the forgery traces in the spatial features. Secondly, for the high-frequency noise capture branch, we utilize a learnable steganalysis rich model (SRM) filter to capture the noise inconsistency information in the forged faces, after which we mine and amplify the forged clues through the parameter-free attention (SimAM) module. Finally, we design a dynamic selection fusion (DSF) module to fully fuse spatial texture features and high-frequency noise features, and adaptively select spatial-frequency features to generate feature representations with strong discriminative power. Extensive experiments show that our proposed model outperforms previous work on multiple benchmark dataset.
engineering, electrical & electronic,imaging science & photographic technology