Abstract:We present TimeWalker, a novel framework that models realistic, full-scale 3D head avatars of a person on lifelong scale. Unlike current human head avatar pipelines that capture identity at the momentary level(e.g., instant photography or short videos), TimeWalker constructs a person's comprehensive identity from unstructured data collection over his/her various life stages, offering a paradigm to achieve full reconstruction and animation of that person at different moments of life. At the heart of TimeWalker's success is a novel neural parametric model that learns personalized representation with the disentanglement of shape, expression, and appearance across ages. Central to our methodology are the concepts of two aspects: (1) We track back to the principle of modeling a person's identity in an additive combination of average head representation in the canonical space, and moment-specific head attribute representations driven from a set of neural head basis. To learn the set of head basis that could represent the comprehensive head variations in a compact manner, we propose a Dynamic Neural Basis-Blending Module (Dynamo). It dynamically adjusts the number and blend weights of neural head bases, according to both shared and specific traits of the target person over ages. (2) Dynamic 2D Gaussian Splatting (DNA-2DGS), an extension of Gaussian splatting representation, to model head motion deformations like facial expressions without losing the realism of rendering and reconstruction. DNA-2DGS includes a set of controllable 2D oriented planar Gaussian disks that utilize the priors from parametric model, and move/rotate with the change of expression. Through extensive experimental evaluations, we show TimeWalker's ability to reconstruct and animate avatars across decoupled dimensions with realistic rendering effects, demonstrating a way to achieve personalized 'time traveling' in a breeze.

ExpAvatar: High-Fidelity Avatar Generation of Unseen Expressions with 3D Face Priors

DynamicAvatars: Accurate Dynamic Facial Avatars Reconstruction and Precise Editing with Diffusion Models

FreeAvatar: Robust 3D Facial Animation Transfer by Learning an Expression Foundation Model

DiffusionAvatars: Deferred Diffusion for High-fidelity 3D Head Avatars

TimeWalker: Personalized Neural Space for Lifelong Head Avatars

GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image

AvatarMAV: Fast 3D Head Avatar Reconstruction Using Motion-Aware Neural Voxels

Learning Personalized High Quality Volumetric Head Avatars from Monocular RGB Videos

High-Fidelity 3D Head Avatars Reconstruction through Spatially-Varying Expression Conditioned Neural Radiance Field

GAN-Avatar: Controllable Personalized GAN-based Human Head Avatar

GenCA: A Text-conditioned Generative Model for Realistic and Drivable Codec Avatars

GPAvatar: Generalizable and Precise Head Avatar from Image(s)

One2Avatar: Generative Implicit Head Avatar For Few-shot User Adaptation

HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors

Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation

Facial Expression Retargeting from Human to Avatar Made Easy

Human 3Diffusion: Realistic Avatar Creation via Explicit 3D Consistent Diffusion Models

DEGAS: Detailed Expressions on Full-Body Gaussian Avatars

AnimateMe: 4D Facial Expressions via Diffusion Models

ConsistentAvatar: Learning to Diffuse Fully Consistent Talking Head Avatar with Temporal Guidance