Abstract:Animatable clothing transfer, aiming at dressing and animating garments across characters, is a challenging problem. Most human avatar works entangle the representations of the human body and clothing together, which leads to difficulties for virtual try-on across identities. What's worse, the entangled representations usually fail to exactly track the sliding motion of garments. To overcome these limitations, we present Layered Gaussian Avatars (LayGA), a new representation that formulates body and clothing as two separate layers for photorealistic animatable clothing transfer from multi-view videos. Our representation is built upon the Gaussian map-based avatar for its excellent representation power of garment details. However, the Gaussian map produces unstructured 3D Gaussians distributed around the actual surface. The absence of a smooth explicit surface raises challenges in accurate garment tracking and collision handling between body and garments. Therefore, we propose two-stage training involving single-layer reconstruction and multi-layer fitting. In the single-layer reconstruction stage, we propose a series of geometric constraints to reconstruct smooth surfaces and simultaneously obtain the segmentation between body and clothing. Next, in the multi-layer fitting stage, we train two separate models to represent body and clothing and utilize the reconstructed clothing geometries as 3D supervision for more accurate garment tracking. Furthermore, we propose geometry and rendering layers for both high-quality geometric reconstruction and high-fidelity rendering. Overall, the proposed LayGA realizes photorealistic animations and virtual try-on, and outperforms other baseline methods. Our project page is <a class="link-external link-https" href="https://jsnln.github.io/layga/index.html" rel="external noopener nofollow">this https URL</a>.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the challenge of virtual try - on and clothing animation transfer between different characters. Specifically, most existing human avatar works entangle the representations of the human body and clothing, which leads to difficulties in cross - identity virtual try - on, and such entangled representations usually cannot accurately track the sliding movement of clothing. To overcome these limitations, the paper proposes Layered Gaussian Avatars (LayGA), a new representation method that treats the body and clothing as two independent layers to achieve photorealistic animatable clothing transfer. ### Main Contributions: 1. **Proposing Layered Gaussian Avatars (LayGA)**: This is the first layered human avatar representation based on 3D Gaussian Splatting (3DGS) for animatable clothing transfer. 2. **Geometric Constraints**: Geometric constraints are introduced to reconstruct smooth surfaces and support collision handling between the body and clothing in the layered representation. 3. **Multi - layer Learning**: In multi - layer learning, the previously segmented reconstruction is used as supervision to track clothing boundaries more accurately. In addition, a rendering layer is introduced to alleviate the problem of degraded rendering quality caused by geometric constraints. ### Method Overview: - **Single - layer Reconstruction Phase**: By introducing a series of geometric constraints, the 3D Gaussian distribution is forced onto the actual geometric surface, while obtaining the segmentation of the body and clothing. - **Multi - layer Fitting Phase**: Two independent models are trained to represent the body and clothing respectively. The previously reconstructed clothing geometry is used as 3D supervision to track clothing movement more accurately. - **Separating the Geometric Layer and the Rendering Layer**: To maintain the smoothness of the geometric layer and avoid the negative impact of geometric constraints on the rendering quality, two offsets are introduced for the geometric layer and the rendering layer respectively. - **Geometric Supervision**: The movement of clothing is directly supervised through the Chamfer distance loss. - **Segmentation Loss**: The segmentation loss is maintained to avoid the situation where clothing is optimized to be transparent. ### Application Scenarios: - **Virtual Try - on**: The clothing of one character can be transferred to another character, achieving cross - identity clothing transfer. - **Clothing Animation**: Generate realistic animations in new poses while handling collision problems. ### Summary: LayGA solves the limitations of existing methods in virtual try - on and clothing animation through layered modeling and geometric constraints, achieving more accurate clothing transfer and high - quality rendering effects.

LayGA: Layered Gaussian Avatars for Animatable Clothing Transfer

LAGA: Layered 3D Avatar Generation and Customization via Gaussian Splatting

GGAvatar: Reconstructing Garment-Separated 3D Gaussian Splatting Avatars from Monocular Video

Modeling clothing as a separate layer for an animatable human avatar

Dressing Avatars: Deep Photorealistic Appearance for Physically Simulated Clothing

Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling

ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians

AniDress: Animatable Loose-Dressed Avatar from Sparse Views Using Garment Rigging Model

Motion Guided Deep Dynamic 3D Garments

PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations

PBDyG: Position Based Dynamic Gaussians for Motion-Aware Clothed Human Avatars

Garment Avatars: Realistic Cloth Driving using Pattern Registration

Animatable and Relightable Gaussians for High-fidelity Human Avatar Modeling

GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians

Dynamic neural garments

BCNet: Learning Body and Cloth Shape from A Single Image

DiffAvatar: Simulation-Ready Garment Optimization with Differentiable Simulation

Drivable Avatar Clothing: Faithful Full-Body Telepresence with Dynamic Clothing Driven by Sparse RGB-D Input

Towards Multi-Layered 3D Garments Animation

Free-form Generation Enhances Challenging Clothed Human Modeling

Gaussian Head & Shoulders: High Fidelity Neural Upper Body Avatars with Anchor Gaussian Guided Texture Warping