Abstract:Background subtraction is a fundamental task in computer vision with numerous real-world applications, ranging from object tracking to video surveillance. Dynamic backgrounds poses a significant challenge here. Supervised deep learning-based techniques are currently considered state-of-the-art for this task. However, these methods require pixel-wise ground-truth labels, which can be time-consuming and expensive. In this work, we propose a weakly supervised framework that can perform background subtraction without requiring per-pixel ground-truth labels. Our framework is trained on a moving object-free sequence of images and comprises two networks. The first network is an autoencoder that generates background images and prepares dynamic background images for training the second network. The dynamic background images are obtained by thresholding the background-subtracted images. The second network is a U-Net that uses the same object-free video for training and the dynamic background images as pixel-wise ground-truth labels. During the test phase, the input images are processed by the autoencoder and U-Net, which generate background and dynamic background images, respectively. The dynamic background image helps remove dynamic motion from the background-subtracted image, enabling us to obtain a foreground image that is free of dynamic artifacts. To demonstrate the effectiveness of our method, we conducted experiments on selected categories of the CDnet 2014 dataset and the I2R dataset. Our method outperformed all top-ranked unsupervised methods. We also achieved better results than one of the two existing weakly supervised methods, and our performance was similar to the other. Our proposed method is online, real-time, efficient, and requires minimal frame-level annotation, making it suitable for a wide range of real-world applications.

Background subtraction for video sequence using deep neural network

Background Subtraction Based on Nonparametric Bayesian Estimation.

A Universal Foreground Segmentation Technique using Deep-Neural Network

Deep Neural Network Concepts for Background Subtraction: A Systematic Review and Comparative Evaluation

Foreground Gating and Background Refining Network for Surveillance Object Detection

Weakly Supervised Realtime Dynamic Background Subtraction

Background Subtraction Based on Modified Pulse Coupled Neural Network in Compressive Domain

Background Subtraction with Dynamic Noise Sampling and Complementary Learning

Dynamic Background Learning Through Deep Auto-Encoder Networks

Background Subtraction Using Dual-Class Backgrounds

Adaptive Difference Modelling for Background Subtraction.

Motion-Based Background Subtraction

Background Subtraction Using Incremental Subspace Learning

Detecting Moving Objects from Dynamic Background Combining Subspace Learning with Mixed Norm Approach

Selective Eigenbackground for Background Modeling and Subtraction in Crowded Scenes

Complex Background Subtraction by Pursuing Dynamic Spatio-Temporal Models

A Multilayer-Based Framework for Online Background Subtraction with Freely Moving Cameras

Background subtraction in real applications: Challenges, current models and future directions

Dynamic Background Estimation and Complementary Learning for Pixel-Wise Foreground/background Segmentation

Background subtraction in dynamic scenes with adaptive spatial fusing.

DeepPBM: Deep Probabilistic Background Model Estimation from Video Sequences