PocoLoco: A Point Cloud Diffusion Model of Human Shape in Loose Clothing

Siddharth Seth,Rishabh Dabral,Diogo Luvizon,Marc Habermann,Ming-Hsuan Yang,Christian Theobalt,Adam Kortylewski

2024-11-07

Abstract:Modeling a human avatar that can plausibly deform to articulations is an active area of research. We present PocoLoco -- the first template-free, point-based, pose-conditioned generative model for 3D humans in loose clothing. We motivate our work by noting that most methods require a parametric model of the human body to ground pose-dependent deformations. Consequently, they are restricted to modeling clothing that is topologically similar to the naked body and do not extend well to loose clothing. The few methods that attempt to model loose clothing typically require either canonicalization or a UV-parameterization and need to address the challenging problem of explicitly estimating correspondences for the deforming clothes. In this work, we formulate avatar clothing deformation as a conditional point-cloud generation task within the denoising diffusion framework. Crucially, our framework operates directly on unordered point clouds, eliminating the need for a parametric model or a clothing template. This also enables a variety of practical applications, such as point-cloud completion and pose-based editing -- important features for virtual human animation. As current datasets for human avatars in loose clothing are far too small for training diffusion models, we release a dataset of two subjects performing various poses in loose clothing with a total of 75K point clouds. By contributing towards tackling the challenging task of effectively modeling loose clothing and expanding the available data for training these models, we aim to set the stage for further innovation in digital humans. The source code is available at <a class="link-external link-https" href="https://github.com/sidsunny/pocoloco" rel="external noopener nofollow">this https URL</a> .

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The problem that this paper attempts to solve is that in human body modeling with loose clothing, existing methods usually rely on parametric human body models or clothing templates, which limits their performance when dealing with loose clothing whose topological structure is quite different from the naked body shape. Specifically, when simulating loose clothing, existing methods often produce artifacts such as surface tearing or wrinkling, and require template fitting for input scans, which is a cumbersome process and may not meet the needs of artists. To address these problems, the paper proposes PocoLoco - the first template - free, point - cloud - based conditional generation model for 3D human body modeling with loose clothing. PocoLoco operates directly on unordered point clouds without assuming point correspondences, thus avoiding the need to use parametric models or clothing templates. This method can not only simulate the deformation of loose clothing more naturally, but also supports functions such as point cloud completion and pose - based editing, which are very important for the creation of virtual human animations. In addition, since the current scale of human body data sets with loose clothing is too small to train diffusion models, the author also releases a high - quality 3D reconstruction data set containing two subjects wearing loose clothing in different postures, with a total of 75,000 point clouds. By contributing more data, the author aims to lay the foundation for future innovations in the field of digital humans.

PocoLoco: A Point Cloud Diffusion Model of Human Shape in Loose Clothing

CLOTH3D: Clothed 3D Humans

Free-form Generation Enhances Challenging Clothed Human Modeling

Neural Point-based Shape Modeling of Humans in Challenging Clothing

The Power of Points for Modeling Humans in Clothing

DLCA-Recon: Dynamic Loose Clothing Avatar Reconstruction from Monocular Videos

AniDress: Animatable Loose-Dressed Avatar from Sparse Views Using Garment Rigging Model

CloSET: Modeling Clothed Humans on Continuous Surface with Explicit Template Decomposition

Dressing Avatars: Deep Photorealistic Appearance for Physically Simulated Clothing

3D Body Shapes Estimation from Dressed-Human Silhouettes.

AvatarFusion: Zero-shot Generation of Clothing-Decoupled 3D Avatars Using 2D Diffusion

ReLoo: Reconstructing Humans Dressed in Loose Garments from Monocular Video in the Wild

Structured Local Radiance Fields for Human Avatar Modeling

MetaAvatar: Learning Animatable Clothed Human Models from Few Depth Images

Learning to Dress 3D People in Generative Clothing

SMPLicit: Topology-aware Generative Model for Clothed People

Dynamic Appearance Modeling of Clothed 3D Human Avatars using a Single Camera

SCULPT: Shape-Conditioned Unpaired Learning of Pose-dependent Clothed and Textured Human Meshes

PBDyG: Position Based Dynamic Gaussians for Motion-Aware Clothed Human Avatars

ESTIMATING HUMAN SHAPE UNDER CLOTHING FROM SINGLE FRONTAL VIEW POINT CLOUD OF A DRESSED HUMAN

HumanCoser: Layered 3D Human Generation via Semantic-Aware Diffusion Model