Optical Flow-Guided Mask Generation Network For Video Segmentation

Yunyi Li,Fangping Chen,Fan Yang,Cong Ma,Yuan Li,Huizhu Jia,Xiaodong Xie
DOI: https://doi.org/10.1109/iscas45731.2020.9181244
2020-01-01
Abstract:The purpose of video segmentation is to segment foreground objects from a video sequence. In this paper, we propose a CNN based method for the semi-supervised video object segmentation, where a hybrid encoder-decoder network is designed to generate pixel-wise foreground object segmentation in use of both spatial and temporal information. In order to minimize cumulative error of the network as much as possible, we develop a two-stage training scheme: alternate training and back-propagation-through-time training. Then the performances of our method and other state-of-the-art ones are compared on two annotated video segmentation databases. Furthermore, we also run an extensive ablation study to test the effects of different components from our method.
What problem does this paper attempt to address?