Recurrent Adversarial Video Prediction Network

Meiju Wang,Guoqiang Zhong,Zhaoyang Deng,Kang Zhang,Peng Jiang
DOI: https://doi.org/10.1088/1742-6596/2278/1/012016
2022-01-01
Abstract:Abstract Mining the intrinsic information of sequential data to predict the future data has a promising research prospect. Considering the temporal features of sequential data, existing approaches generally adopt recurrent neural network and its variants for the prediction. However, for sequences with complex structure, such as video frame sequence, these approaches cannot guarantee to obtain promising prediction results. In this paper, to address the above issue, we propose a novel architecture, called recurrent adversarial video prediction network (RAVPN), which can not only extract the temporal and spatial features of video sequences, but also optimize the generator and discriminator based on the adversarial strategy. Specifically, we use sliding windows with length t +1 and set the (t + 1)-th frame as the label of its previous t frames. The generator takes the first t frames as input and tries to generate the (t + 1)-th frame, while the discriminator distinguishes whether a sample is real or fake to boost the performance of the generator. Experimental results show that our novel RAVPN can obtain a promising performance on video prediction tasks compared with other deep sequence prediction models.
What problem does this paper attempt to address?