Predicting Future Instance Segmentation with Contextual Pyramid ConvLSTMs

Jiangxin Sun,Jiafeng Xie,Jian-fang Hu,Zihang Lin,Jianhuang Lai,Wenjun Zeng,Wei-shi Zheng
DOI: https://doi.org/10.1145/3343031.3350949
2019-01-01
Abstract:Despite the remarkable progress in instance segmentation, the problem of predicting future instance segmentation remains challenging due to the unobservability of future data. Existing methods mainly address this challenge by forecasting pyramid features to represent unobserved future frames. However, they mainly predict features for each pyramid level independently, and ignore the underlying structural relationship between features of different levels. In this paper, we propose a novel framework called Contextual Pyramid ConvLSTMs, which contains a set of ConvLSTMs to exploit intra-level spatio-temporal contexts for predicting features of each individual level. Moreover, we also add pathway connections among the ConvLSTMs to transmit information across different ConvLSTMs, which allows our system to capture more inter-level spatio-temporal contextual information. We experimentally show that the proposed method can achieve state-of-the-art performance on two video instance segmentation benchmarks for future instance segmentation prediction.
What problem does this paper attempt to address?