Abstract:Face reenactment methods attempt to restore and re-animate portrait videos as realistically as possible. Existing methods face a dilemma in quality versus controllability: 2D GAN-based methods achieve higher image quality but suffer in fine-grained control of facial attributes compared with 3D counterparts. In this work, we propose StyleAvatar, a real-time photo-realistic portrait avatar reconstruction method using StyleGAN-based networks, which can generate high-fidelity portrait avatars with faithful expression control. We expand the capabilities of StyleGAN by introducing a compositional representation and a sliding window augmentation method, which enable faster convergence and improve translation generalization. Specifically, we divide the portrait scenes into three parts for adaptive adjustments: facial region, non-facial foreground region, and the background. Besides, our network leverages the best of UNet, StyleGAN and time coding for video learning, which enables high-quality video generation. Furthermore, a sliding window augmentation method together with a pre-training strategy are proposed to improve translation generalization and training performance, respectively. The proposed network can converge within two hours while ensuring high image quality and a forward rendering time of only 20 milliseconds. Furthermore, we propose a real-time live system, which further pushes research into applications. Results and experiments demonstrate the superiority of our method in terms of image quality, full portrait video generation, and real-time re-animation compared to existing facial reenactment methods. Training and inference code for this paper are at <a class="link-external link-https" href="https://github.com/LizhenWangT/StyleAvatar" rel="external noopener nofollow">this https URL</a>.

D2Animator: Dual Distillation of StyleGAN for High-Resolution Face Animation

Realistic Face Reenactment Via Self-Supervised Disentangling of Identity and Pose

Style Fader Generative Adversarial Networks for Style Degree Controllable Artistic Style Transfer

StyleGAN2 Distillation for Feed-Forward Image Manipulation

StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN

AniGAN: Style-Guided Generative Adversarial Networks for Unsupervised Anime Face Generation

Deformable One-shot Face Stylization via DINO Semantic Guidance

Video2StyleGAN: Disentangling Local and Global Variations in a Video

D2Styler: Advancing Arbitrary Style Transfer with Discrete Diffusion Methods

BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation

AgileGAN3D: Few-Shot 3D Portrait Stylization by Augmented Transfer Learning

AniFaceDiff: Animating Stylized Avatars via Parametric Conditioned Diffusion Models

HyperStyle3D: Text-Guided 3D Portrait Stylization via Hypernetworks

One-Shot Face Video Re-enactment using Hybrid Latent Spaces of StyleGAN2

Multi-Modal Face Stylization with a Generative Prior

Your3dEmoji: Creating Personalized Emojis Via One-shot 3D-Aware Cartoon Avatar Synthesis.

PuppeteerGAN: Arbitrary Portrait Animation With Semantic-Aware Appearance Transformation

High-Fidelity Face Swapping with Style Blending

Face Animation with an Attribute-Guided Diffusion Model

StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video