Real-Time Video Forgery Detection Via Vision-WiFi Silhouette Correspondence

Jianwei Liu,Xinyue Fang,Yike Chen,Jiantao Yuan,Guanding Yu,Jinsong Han
DOI: https://doi.org/10.1109/tmc.2024.3483550
IF: 6.075
2024-01-01
IEEE Transactions on Mobile Computing
Abstract:For safety guard and crime prevention, video surveillance systems have been pervasively deployed in many security-critical scenarios, such as the residence, retail stores, and banks. However, these systems could be infiltrated by the adversary and the video streams would be modified or replaced, i.e., under the video forgery attack. The prevalence of Internet of Things (IoT) devices and the emergence of Deepfake- like techniques severely emphasize the vulnerability of video surveillance systems under such attacks. To secure existing surveillance systems, in this paper we propose a vision-WiFi cross-modal video forgery detection system, namely WiSil . Leveraging a theoretical model based on the principle of signal propagation, WiSil constructs wave front information of the object in the monitoring area from WiFi signals. With a well-designed deep learning network, WiSil further recovers silhouettes from the wave front information. Based on a Siamese network-based semantic feature extractor, WiSil can eventually determine whether a frame is manipulated by comparing the semantic feature vectors extracted from the video's silhouette with those extracted from the WiFi's silhouette. We enhance the basic version of WiSil [1] by developing a model compression method and a forgery trace localization method. Extensive experiments show that WiSil achieves 95%+ accuracy in detecting tampered frames.
What problem does this paper attempt to address?