Abstract:Natural scenes contain a wide range of textured motion phenomena which are characterized by the movement of a large amount of particle and wave elements, such as falling snow, wavy water, and dancing grass. In this paper, we present a generative model for representing these motion patterns and study a Markov chain Monte Carlo algorithm for inferring the generative representation from observed video sequences. Our generative model consists of three components. The first is a photometric model which represents an image as a linear superposition of image bases selected from a generic and overcomplete dictionary. The dictionary contains Gabor and LoG bases for point/particle elements and Fourier bases for wave elements. These bases compete to explain the input images and transfer them to a token (base) representation with an O(10(2))-fold dimension reduction. The second component is a geometric model which groups spatially adjacent tokens (bases) and their motion trajectories into a number of moving elements--called "motons." A moton is a deformable template in time-space representing a moving element, such as a falling snowflake or a flying bird. The third component is a dynamic model which characterizes the motion of particles, waves, and their interactions. For example, the motion of particle objects floating in a river, such as leaves and balls, should be coupled with the motion of waves. The trajectories of these moving elements are represented by coupled Markov chains. The dynamic model also includes probabilistic representations for the birth/death (source/sink) of the motons. We adopt a stochastic gradient algorithm for learning and inference. Given an input video sequence, the algorithm iterates two steps: 1) computing the motons and their trajectories by a number of reversible Markov chain jumps, and 2) learning the parameters that govern the geometric deformations and motion dynamics. Novel video sequences are synthesized from the learned models and, by editing the model parameters, we demonstrate the controllability of the generative model.

Generative Image Dynamics

Automated Video Looping with Progressive Dynamism

DTVNet: Dynamic Time-Lapse Video Generation via Single Still Image

Modeling Textured Motion : Particle, Wave and Sketch.

A Generative Method for Textured Motion: Analysis and Synthesis

Analysis and synthesis of textured motion: particles and waves.

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Modeling Complex Motion by Tracking and Editing Hidden Markov Graphs.

Analysis and Synthesis of Textured Motion: Particle, Wave and Cartoon Sketch

PhysMotion: Physics-Grounded Dynamics From a Single Image

Motion Prompting: Controlling Video Generation with Motion Trajectories

Motion-Based Generator Model: Unsupervised Disentanglement of Appearance, Trackable and Intrackable Motions in Dynamic Patterns.

Compressing Scene Dynamics: A Generative Approach

Animate124: Animating One Image to 4D Dynamic Scene

Learning In-between Imagery Dynamics via Physical Latent Spaces

Modeling Complex Motion: Photometric, Geometric, Dynamic, and Topological Aspects

Dynamic texture synthesis using a spatial temporal descriptor

Controllable Longer Image Animation with Diffusion Models

Animate Your Motion: Turning Still Images into Dynamic Videos

Dynamical Textures Modeling Via Joint Video Dictionary Learning

Motion Modes: What Could Happen Next?