Background subtraction for video sequence using deep neural network

Yuan Dai,Long Yang
DOI: https://doi.org/10.1007/s11042-024-18843-3
IF: 2.577
2024-03-14
Multimedia Tools and Applications
Abstract:Background subtraction aims to extract moving objects from a video sequence which is a prerequisite for high-level surveillance video analysis. There are many challenges triggered by dynamic background, illumination changes, shadows, camera jittering, etc. in developing a robust background subtraction approach. In this paper, we propose an encoder-decoder type deep neural network to tackle the task of moving object detection from video sequences. The encoder is constructed based on VGG-16 Net and Resnet-50 to extract the hierarchical features from a raw image which are robust compared to the handcrafted features. A transposed convolutional neural network is employed in the decoder to map the features to a prediction result for foreground and background classification. We also design an adapted focal loss function to balance the loss contribution of positive and negative classes, as well as hard and easy samples in different frames according to their imbalance degrees. The model is evaluated on CDnet2014 and SBI2015 datasets using only a small number of training frames for various challenging scenes. The experimental results demonstrate that our method outperforms the state-of-the-art methods without any post-processing.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?