Video Anomaly Detection Via Successive Image Frame Prediction Leveraging Optical Flows

Hongyong Wang,Xinjian Zhang,Su Yang,Weishan Zhang,Jie Li,Haiyun Wang
DOI: https://doi.org/10.1109/prai55851.2022.9904101
2022-01-01
Abstract:Video anomaly detection is challenging due to the diversity of abnormal events, which turns unsupervised learning the promising solution in recent endeavors. In a framework as such, the existing works hold with such an assumption that anomalies cannot be reconstructed or predicated from historical data as accurately as normal ones, so the reconstruction or prediction error can act as an indicator of anomalies. In this study, we propose to discriminate anomalies from normal ones by fusing both appearance and motion in a frame prediction framework, where the uniqueness lies in that we embed optical flows into the frame prediction framework as the clue to direct the transformation from the input to the predicted frame, making appearance-motion fusion quite natural without any extra effort to align them. Then, the error of predicting the next frame based on the concatenation of the appearance and the associated motion of the present frame can indicate the anomaly score. Notably, we employ one frame-based optical flow computation instead of the traditional ones over two continuous frames. The goal is to make the optical flows subject to the whole training data such that anomalies deviating remarkably from the training data will result in highly distorted optical flows and relatively high prediction error correspondingly, which is not promised by the traditional optical flows rooting in the differential of two continuous frames. In summary, we extend the appearance-motion correspondence learning to motion-guided prediction tying the appearances of two consecutive frames. We also introduce a margin loss to enhance the learning of frame prediction. Experiments on widely accepted benchmarks demonstrate the state-of-the-art performance of our approach.
What problem does this paper attempt to address?