X-Avatar: Expressive Human Avatars

Kaiyue Shen,Chen Guo,Manuel Kaufmann,Juan Jose Zarate,Julien Valentin,Jie Song,Otmar Hilliges

2023-03-09

Abstract:We present X-Avatar, a novel avatar model that captures the full expressiveness of digital humans to bring about life-like experiences in telepresence, AR/VR and beyond. Our method models bodies, hands, facial expressions and appearance in a holistic fashion and can be learned from either full 3D scans or RGB-D data. To achieve this, we propose a part-aware learned forward skinning module that can be driven by the parameter space of SMPL-X, allowing for expressive animation of X-Avatars. To efficiently learn the neural shape and deformation fields, we propose novel part-aware sampling and initialization strategies. This leads to higher fidelity results, especially for smaller body parts while maintaining efficient training despite increased number of articulated bones. To capture the appearance of the avatar with high-frequency details, we extend the geometry and deformation fields with a texture network that is conditioned on pose, facial expression, geometry and the normals of the deformed surface. We show experimentally that our method outperforms strong baselines in both data domains both quantitatively and qualitatively on the animation task. To facilitate future research on expressive avatars we contribute a new dataset, called X-Humans, containing 233 sequences of high-quality textured scans from 20 participants, totalling 35,500 data frames.

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The paper aims to address the problem of creating high-fidelity digital human avatars capable of capturing body posture, gestures, facial expressions, and appearance details. Specifically, the paper proposes the X-Avatar model, an animatable implicit human avatar model that can capture the full expression of the human body. This model can capture body posture, hand posture, facial expressions, and appearance within a unified framework and can be created from 3D scans or RGB-D images. X-Avatar improves training efficiency and result quality by introducing part-aware initialization and sampling strategies, and it has been validated on multiple datasets, demonstrating its superior performance in animation tasks. Additionally, the authors contribute a new dataset named X-Humans, which includes high-quality texture scans of 20 participants in different postures and expressions, totaling 35,500 frames.

X-Avatar: Expressive Human Avatars

Expressive Whole-Body 3D Gaussian Avatar

XAGen: 3D Expressive Human Avatars Generation

AvatarReX: Real-time Expressive Full-body Avatars

Expressive Gaussian Human Avatars from Monocular RGB Video

I M Avatar: Implicit Morphable Head Avatars from Videos

HAHA: Highly Articulated Gaussian Human Avatars with Textured Mesh Prior

TexVocab: Texture Vocabulary-conditioned Human Avatars

X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation

GAN-Avatar: Controllable Personalized GAN-based Human Head Avatar

Deformable 3D Gaussian Splatting for Animatable Human Avatars

DEGAS: Detailed Expressions on Full-Body Gaussian Avatars

GETAvatar: Generative Textured Meshes for Animatable Human Avatars

NECA: Neural Customizable Human Avatar

XHand: Real-time Expressive Hand Avatar

HR Human: Modeling Human Avatars with Triangular Mesh and High-Resolution Textures from Videos

AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text

FreeAvatar: Robust 3D Facial Animation Transfer by Learning an Expression Foundation Model

AniArtAvatar: Animatable 3D Art Avatar from a Single Image

HQ-Avatar: Towards High-Quality 3D Avatar Generation Via Point-based Representation

Dressing Avatars: Deep Photorealistic Appearance for Physically Simulated Clothing