SUPER: Selfie Undistortion and Head Pose Editing with Identity Preservation

Polina Karpikova,Andrei Spiridonov,Anna Vorontsova,Anastasia Yaschenko,Ekaterina Radionova,Igor Medvedev,Alexander Limonov
2024-06-18
Abstract:Self-portraits captured from a short distance might look unnatural or even unattractive due to heavy distortions making facial features malformed, and ill-placed head poses. In this paper, we propose SUPER, a novel method of eliminating distortions and adjusting head pose in a close-up face crop. We perform 3D GAN inversion for a facial image by optimizing camera parameters and face latent code, which gives a generated image. Besides, we estimate depth from the obtained latent code, create a depth-induced 3D mesh, and render it with updated camera parameters to obtain a warped portrait. Finally, we apply the visibility-based blending so that visible regions are reprojected, and occluded parts are restored with a generative model. Experiments on face undistortion benchmarks and on our self-collected Head Rotation dataset (HeRo), show that SUPER outperforms previous approaches both qualitatively and quantitatively, opening new possibilities for photorealistic selfie editing.
Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The paper aims to address the issues of facial geometric distortion and improper head pose in selfie photos. Specifically, selfies taken at close range may suffer from perspective distortion, leading to deformed or asymmetrical facial features, such as an overly large nose, small or obscured ears, etc. Additionally, choosing an appropriate angle for selfies is challenging, making it necessary to adjust the head pose. To solve these problems, the paper proposes a new method called SUPER, which combines Generative Adversarial Networks (GAN) with 3D-based deformation techniques to achieve high-quality selfie editing while preserving identity features. SUPER can eliminate distortions and adjust head poses while ensuring the realism and detail of the image. Experimental results show that it significantly outperforms existing methods in facial distortion correction benchmarks.