BEAVP: A Bidirectional Enhanced Adversarial Model for Video Prediction

Peiyuan Zhu,Shengjie Zhao,Fengxia Han,Hao Deng
DOI: https://doi.org/10.1109/fg59268.2024.10581919
2024-01-01
Abstract:Predicting future frames in videos is crucial for motion understanding and behavior analysis. However, despite significant advancements, existing stochastic methods have insufficient utilization of motion patterns, leading to blurry motion in long-term predictions. Most of the previous work also lacks constraints to effectively address the unconstrained nature of spacetime-varying motion. In this paper, we propose a stochastic video prediction model based on coupled GANs. The pair of GANs could model motion trends based on adjacent frames organized in sequential and reverse orders, respectively. We assume a common latent space assumption and build bridges between forward prediction and backward prediction by leveraging the constraints of weight-sharing and cycle- consistency. Specifically, we propose to learn a joint distribution with adjacent frames in opposite orders drawn from the marginal distributions and enhance forward prediction with an in-depth exploration of motion patterns. Through experiments on several challenging datasets that include spacetime-varying human motion, we show that our model surpasses the performance of state-of-the-art models, thus validating the effectiveness of our proposed approach.
What problem does this paper attempt to address?