Abstract:Emerging Metaverse applications demand accessible, accurate, and easy-to-use tools for 3D digital human creations in order to depict different cultures and societies as if in the physical world. Recent large-scale vision-language advances pave the way to for novices to conveniently customize 3D content. However, the generated CG-friendly assets still cannot represent the desired facial traits for human characteristics. In this paper, we present DreamFace, a progressive scheme to generate personalized 3D faces under text guidance. It enables layman users to naturally customize 3D facial assets that are compatible with CG pipelines, with desired shapes, textures, and fine-grained animation capabilities. From a text input to describe the facial traits, we first introduce a coarse-to-fine scheme to generate the neutral facial geometry with a unified topology. We employ a selection strategy in the CLIP embedding space, and subsequently optimize both the details displacements and normals using Score Distillation Sampling from generic Latent Diffusion Model. Then, for neutral appearance generation, we introduce a dual-path mechanism, which combines the generic LDM with a novel texture LDM to ensure both the diversity and textural specification in the UV space. We also employ a two-stage optimization to perform SDS in both the latent and image spaces to significantly provides compact priors for fine-grained synthesis. Our generated neutral assets naturally support blendshapes-based facial animations. We further improve the animation ability with personalized deformation characteristics by learning the universal expression prior using the cross-identity hypernetwork. Notably, DreamFace can generate of realistic 3D facial assets with physically-based rendering quality and rich animation ability from video footage, even for fashion icons or exotic characters in cartoons and fiction movies.

DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models

DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models

DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion

DreamWaltz: Make a Scene with Complex 3D Animatable Avatars

DreamHuman: Animatable 3D Avatars from Text

Guide3D: Create 3D Avatars from Text and Image Guidance

UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with Authenticity Guided Textures

SEEAvatar: Photorealistic Text-to-3D Avatar Generation with Constrained Geometry and Appearance

AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text

Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation

Human 3Diffusion: Realistic Avatar Creation via Explicit 3D Consistent Diffusion Models

GETAvatar: Generative Textured Meshes for Animatable Human Avatars

MagicMirror: Fast and High-Quality Avatar Generation with a Constrained Search Space

DreamFace: Progressive Generation of Animatable 3D Faces under Text Guidance

Articulated 3D Head Avatar Generation using Text-to-Image Diffusion Models

AvatarBooth: High-Quality and Customizable 3D Human Avatar Generation

AvatarVerse: High-quality & Stable 3D Avatar Creation from Text and Pose

Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model

AvatarFusion: Zero-shot Generation of Clothing-Decoupled 3D Avatars Using 2D Diffusion

StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation

GenCA: A Text-conditioned Generative Model for Realistic and Drivable Codec Avatars