TFAE: temporal feature adjustable enhancement for video anomaly detection
Jing Liang,Yuanyuan Wu,Wu Zeng,Yuan Zeng
DOI: https://doi.org/10.1007/s11042-024-19660-4
IF: 2.577
2024-07-20
Multimedia Tools and Applications
Abstract:The objective of video anomaly detection is to detect events in video streams that depart from typical behavior or have potential threats. In this paper, we consider video anomaly detection as a weakly supervised problem and use a Multiple Instance Learning (MIL) paradigm based methodology for research. The accuracy of video anomaly detection can be significantly enhanced, and its credibility can be increased, by making full use of the complex dynamic characteristics present in videos, such as variations in movement speeds and target objects. One of the commonly employed techniques for capturing video dynamics is through modeling temporal feature. However, most existing methods are restricted in their ability to capture temporal information. Therefore, we propose the Temporal Feature Adjustable Enhancement (TFAE) network, which maximizes the utilization of complex video dynamics. The TFAE network comprises two key components: 1) a local branch that captures the local temporal dynamics of videos, and 2) a global branch that aggregates overall feature information to achieve temporal enhancement. Additionally, we introduce a new optimization loss composed of outside bag loss and inside bag loss to effectively separate abnormal and normal videos. The outside bag loss applies smooth mapping to mitigate the impact of gradient vanishing, while the inside bag loss restricts the search space for weakly supervised paradigms. Experimental results showcase that our method attains state-of-the-art performance on UCF-Crime, ShanghaiTech, and UCSD Ped2 datasets.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering