4D Agnostic Real-Time Facial Animation Pipeline for Desktop Scenarios

Wei Chen,HongWei Xu,Jelo Wang
2023-04-06
Abstract:We present a high-precision real-time facial animation pipeline suitable for animators to use on their desktops. This pipeline is about to be launched in FACEGOOD's Avatary\footnote{<a class="link-external link-https" href="https://www.avatary.com/" rel="external noopener nofollow">this https URL</a>} software, which will accelerate animators' productivity. The pipeline differs from professional head-mounted facial capture solutions in that it only requires the use of a consumer-grade 3D camera on the desk to achieve high-precision real-time facial capture. The system enables animators to create high-quality facial animations with ease and speed, while reducing the cost and complexity of traditional facial capture solutions. Our approach has the potential to revolutionize the way facial animation is done in the entertainment industry.
Graphics,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to develop a high - precision real - time facial animation pipeline suitable for use by animators in a desktop environment. Specifically, the paper aims to improve the existing facial capture technologies in the following aspects: 1. **Reduce cost and complexity**: Traditional facial capture solutions usually require professional head - mounted devices, which are costly and complex to operate. The method proposed in this paper can achieve high - precision real - time facial capture only by using consumer - level 3D cameras on the desktop, thus reducing hardware requirements and operational difficulty. 2. **Improve productivity**: By simplifying the facial capture process and improving the capture accuracy, this method can significantly enhance the work efficiency of animators, enabling them to create high - quality facial animations more easily and quickly. 3. **Achieve real - time and naturalness**: This method not only realizes the real - time generation of facial animations but also ensures the naturalness and expressiveness of the animations. By accurately tracking facial features (such as eyes, mouths, eyebrows, etc.) and calculating weights, the generated facial animations are more realistic. 4. **Wide application**: This technology can be applied in multiple fields such as movie, video game production, virtual reality experiences, and remote conferences, and has wide practical value. ### Technical details To achieve the above goals, the paper proposes a facial reconstruction process that includes three main steps: - **Fusion**: Generate a complete face point - cloud data by merging multiple face images taken from different angles. - **3DMM (3D Morphable Model)**: Use the 3DMM method to construct a more realistic and accurate 3D face model to capture the unique features of the human face. - **Non - rigid ICP (Non - rigid Iterative Closest Point)**: Optimize the 3D model to better fit the original image by adjusting the position and orientation of facial features. In addition, the paper also describes in detail how to further improve the quality and naturalness of facial animations through technical means such as calculating weights, filtering weights, and eye - tracking. ### Conclusion Through the above methods, the paper successfully realizes a real - time driving system based on 52 blendshapes, which can efficiently and accurately capture and reproduce users' facial expressions in a desktop environment. This achievement provides strong technical support for the popularization of real - time facial animations in multiple application fields.