AgileAvatar: Stylized 3D Avatar Creation via Cascaded Domain Bridging

Shen Sang,Tiancheng Zhi,Guoxian Song,Minghao Liu,Chunpong Lai,Jing Liu,Xiang Wen,James Davis,Linjie Luo

DOI: https://doi.org/10.1145/3550469.3555402

2022-11-15

Abstract:Stylized 3D avatars have become increasingly prominent in our modern life. Creating these avatars manually usually involves laborious selection and adjustment of continuous and discrete parameters and is time-consuming for average users. Self-supervised approaches to automatically create 3D avatars from user selfies promise high quality with little annotation cost but fall short in application to stylized avatars due to a large style domain gap. We propose a novel self-supervised learning framework to create high-quality stylized 3D avatars with a mix of continuous and discrete parameters. Our cascaded domain bridging framework first leverages a modified portrait stylization approach to translate input selfies into stylized avatar renderings as the targets for desired 3D avatars. Next, we find the best parameters of the avatars to match the stylized avatar renderings through a differentiable imitator we train to mimic the avatar graphics engine. To ensure we can effectively optimize the discrete parameters, we adopt a cascaded relaxation-and-search pipeline. We use a human preference study to evaluate how well our method preserves user identity compared to previous work as well as manual creation. Our results achieve much higher preference scores than previous work and close to those of manual creation. We also provide an ablation study to justify the design choices in our pipeline.

Computer Vision and Pattern Recognition,Graphics

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to automatically create high - quality personalized 3D avatars, especially stylized 3D avatars, from users' self - portrait photos. Manually creating such avatars usually requires tedious selection and adjustment from a large number of artistic resources, which is both time - consuming and difficult for ordinary users. Although existing self - supervised methods can automatically generate semi - realistic 3D avatars from users' self - portraits and perform well in maintaining user identity, these methods are not effective when applied to stylized avatars because of the large style - domain gap between self - portraits and stylized avatars. To overcome these challenges, the author proposes a new self - supervised learning framework that can handle the mixture of continuous and discrete parameters to create high - quality stylized 3D avatars. Specifically, this framework gradually narrows the style - domain gap through three stages: 1) Portrait stylization, converting the input self - portrait into a stylized avatar rendering; 2) Self - supervised avatar parameterization, finding the optimal avatar parameters by training a differentiable simulator that imitates the behavior of the graphics engine; 3) Avatar vector conversion, converting the parameters in the relaxed avatar vector space into the parameters in the strict avatar vector space so that the graphics engine can use them directly. In addition, the paper also evaluates the performance of this method in retaining personal identity through human preference studies. The results show that this method scores higher than existing methods and is close to the effect of manual creation. The author also provides ablation studies to prove the effectiveness of pipeline design choices.

AgileAvatar: Stylized 3D Avatar Creation via Cascaded Domain Bridging

3Dtoonify: Creating Your High-Fidelity 3D Stylized Avatar Easily from 2D Portrait Images

SwiftAvatar: Efficient Auto-Creation of Parameterized Stylized Character on Arbitrary Avatar Engines

AlteredAvatar: Stylizing Dynamic 3D Avatars with Fast Style Adaptation

Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture

Fast 3D Stylized Gaussian Portrait Generation From a Single Image With Style Aligned Sampling Loss

Your3dEmoji: Creating Personalized Emojis Via One-shot 3D-Aware Cartoon Avatar Synthesis.

StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation

AniArtAvatar: Animatable 3D Art Avatar from a Single Image

X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation

PuzzleAvatar: Assembling 3D Avatars from Personal Albums

SEEAvatar: Photorealistic Text-to-3D Avatar Generation with Constrained Geometry and Appearance

UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with Authenticity Guided Textures

MagicMirror: Fast and High-Quality Avatar Generation with a Constrained Search Space

AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text

Avatar digitization from a single image for real-time rendering

DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models

AvatarBooth: High-Quality and Customizable 3D Human Avatar Generation

AvatarVerse: High-quality & Stable 3D Avatar Creation from Text and Pose

Learning Personalized High Quality Volumetric Head Avatars from Monocular RGB Videos

GenCA: A Text-conditioned Generative Model for Realistic and Drivable Codec Avatars