Abstract:Recent generative methods have revolutionized the way of human motion synthesis, such as Variational Autoencoders (VAEs), Generative Adversarial Networks (GANs), and Denoising Diffusion Probabilistic Models (DMs). These methods have gained significant attention in human motion fields. However, there are still challenges in unconditionally generating highly diverse human motions from a given distribution. To enhance the diversity of synthesized human motions, previous methods usually employ deep neural networks (DNNs) to train a transport map that transforms Gaussian noise distribution into real human motion distribution. According to Figalli's regularity theory, the optimal transport map computed by DNNs frequently exhibits discontinuities. This is due to the inherent limitation of DNNs in representing only continuous maps. Consequently, the generated human motions tend to heavily concentrate on densely populated regions of the data distribution, resulting in mode collapse or mode mixture. To address the issues, we propose an efficient method called MOOT for unconditional human motion synthesis. First, we utilize a reconstruction network based on GRU and transformer to map human motions to latent space. Next, we employ convex optimization to match the noise distribution with the latent space distribution of human motions through the Optimal Transport (OT) map. Then, we combine the extended OT map with the generator of reconstruction network to generate new human motions. Thereby overcoming the issues of mode collapse and mode mixture. MOOT generates a latent code distribution that is well-behaved and highly structured, providing a strong motion prior for various applications in the field of human motion. Through qualitative and quantitative experiments, MOOT achieves state-of-the-art results surpassing the latest methods, validating its superiority in unconditional human motion generation.

Towards Efficient and Diverse Generative Model for Unconditional Human Motion Synthesis

Neural Motion Graph.

Human Motion Diffusion Model

ViMo: Generating Motions from Casual Videos

Generative Model-Enhanced Human Motion Prediction

DiverseMotion: Towards Diverse Human Motion Generation Via Discrete Diffusion

Human Motion Generation: A Survey

Executing Your Commands Via Motion Diffusion in Latent Space.

Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic Models

Diverse Human Motion Prediction via Gumbel-Softmax Sampling from an Auxiliary Space

Human Motion Transfer With 3D Constraints and Detail Enhancement

Generating Continual Human Motion in Diverse 3D Scenes

Multi-Resolution Generative Modeling of Human Motion from Limited Data

StableMoFusion: Towards Robust and Efficient Diffusion-based Motion Generation Framework

MotionGPT: Finetuned LLMs Are General-Purpose Motion Generators

Purposer: Putting Human Motion Generation in Context

Dynamic Future Net: Diversified Human Motion Generation

Searching Motion Graphs for Human Motion Synthesis.

REMOT: A Region-to-Whole Framework for Realistic Human Motion Transfer

Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance