DevsNet: Deep Video Saliency Network Using Short-term and Long-term Cues

Yuming Fang,Chi Zhang,Xiongkuo Min,Hanqin Huang,Yugen Yi,Guangtao Zhai,Chia-Wen Lin
DOI: https://doi.org/10.1016/j.patcog.2020.107294
IF: 8
2020-01-01
Pattern Recognition
Abstract:•We design a novel video saliency detection model by design the new 3-D ConvNet and B-ConvLSTM to extract short-term and long-term spatiotemporal cues, respectively. Through combining short-term and long-term spatiotemporal features, the proposed model can obtain promising performance for video saliency prediction.•We design a new two-layer B-ConvLSTM structure for long-term spatiotemporal feature extraction for video saliency detection. The proposed B-ConvLSTM can extract the temporal information not just from the previous video frames but also from the next frames, which demonstrates that the proposed network takes both the forward and backward temporal features into account.
What problem does this paper attempt to address?