VRMM: A Volumetric Relightable Morphable Head Model

Haotian Yang,Mingwu Zheng,Chongyang Ma,Yu-Kun Lai,Pengfei Wan,Haibin Huang
2024-02-06
Abstract:In this paper, we introduce the Volumetric Relightable Morphable Model (VRMM), a novel volumetric and parametric facial prior for 3D face modeling. While recent volumetric prior models offer improvements over traditional methods like 3D Morphable Models (3DMMs), they face challenges in model learning and personalized reconstructions. Our VRMM overcomes these by employing a novel training framework that efficiently disentangles and encodes latent spaces of identity, expression, and lighting into low-dimensional representations. This framework, designed with self-supervised learning, significantly reduces the constraints for training data, making it more feasible in practice. The learned VRMM offers relighting capabilities and encompasses a comprehensive range of expressions. We demonstrate the versatility and effectiveness of VRMM through various applications like avatar generation, facial reconstruction, and animation. Additionally, we address the common issue of overfitting in generative volumetric models with a novel prior-preserving personalization framework based on VRMM. Such an approach enables accurate 3D face reconstruction from even a single portrait input. Our experiments showcase the potential of VRMM to significantly enhance the field of 3D face modeling.
Graphics
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the poor performance of existing 3D facial modeling methods under dynamic expressions and changing lighting conditions. Specifically, although traditional 3D Morphable Models (3DMMs) can encode facial identity, expressions and other attributes, they have difficulties in achieving truly realistic details, especially when dealing with complex head components such as hair and the inside of the mouth. And although the existing volume - based models are more comprehensive in representing facial structures, they still have significant shortcomings in dynamic expression modeling and simulating the impact of different lighting conditions on the face. In addition, these models are prone to over - fitting problems in personalized reconstruction tasks, making it difficult to perform high - quality 3D facial reconstruction from a small number of input images. To this end, the paper proposes a brand - new Volume Relightable Morphable Model (VRMM), aiming to overcome the above challenges. VRMM efficiently decouples and encodes the low - dimensional representation spaces of identity, expression and lighting through the adoption of a self - supervised learning framework, thereby significantly reducing the constraints of training data and making it more practical in practice. In addition, the paper also proposes a VRMM - based prior - preserving personalization framework to solve the over - fitting problem in the generated volume model, enabling accurate 3D facial reconstruction even from a single portrait input. In summary, the main contributions of the paper are as follows: 1. Propose VRMM, which, as far as the authors know, is the first 3D volume facial prior model that can be continuously relighted and contains a complete range of expressions. 2. Design a new training framework for learning the decoupled parameter spaces of expressions, identity and lighting from multi - view image sequences captured under controllable lighting conditions. 3. Propose a novel personalization method, carefully designed to maintain the animatable and relightable characteristics provided by the prior, so as to be able to achieve high - fidelity avatar reconstruction from several or even a single image. 4. Extensive experiments show that VRMM performs excellently in various applications and outperforms previous methods.