Abstract:Facial landmark detection, head pose estimation, and facial deformation analysis are typical facial behavior analysis tasks in computer vision. The existing methods usually perform each task independently and sequentially, ignoring their interactions. To tackle this problem, we propose a unified framework for simultaneous facial landmark detection, head pose estimation, and facial deformation analysis, and the proposed model is robust to facial occlusion. Following a cascade procedure augmented with model-based head pose estimation, we iteratively update the facial landmark locations, facial occlusion, head pose and facial de- formation until convergence. The experimental results on benchmark databases demonstrate the effectiveness of the proposed method for simultaneous facial landmark detection, head pose and facial deformation estimation, even if the images are under facial occlusion.

What problem does this paper attempt to address?

The problems that this paper attempts to solve are several key tasks in facial behavior analysis - facial landmark detection, head pose estimation, and facial deformation analysis. These tasks become more complex and challenging when the face is occluded. Existing methods usually perform each task independently and sequentially, ignoring the interactions among them. To solve this problem, the authors propose a unified framework that can perform facial landmark detection, head pose estimation, and facial deformation analysis simultaneously, and the model is robust to facial occlusion. Specifically, the goals of the paper include: 1. **Unified framework**: Develop a method that can handle facial landmark detection, head pose estimation, and facial deformation analysis simultaneously, and utilize the joint relationships among these tasks to improve the performance of all tasks. 2. **Robustness**: Ensure that the proposed method is still effective when the face is partially or completely occluded. 3. **Iterative update**: Through a cascading process, combined with model - based head pose estimation, iteratively update the facial landmark positions, facial occlusion, head pose, and facial deformation until convergence. 4. **Experimental verification**: Prove the effectiveness of the proposed method through experimental results on benchmark databases, especially in the case of facial occlusion. The main contributions of the paper are: - Propose an iterative cascading method that can perform facial landmark detection, pose, and deformation estimation simultaneously, which is different from most existing methods that are processed independently or sequentially. - Systematically integrate learning - based facial landmark detection with model - based head pose and facial deformation estimation without 3D annotations. - Explicitly estimate facial occlusion, which is helpful for landmark detection, pose, and deformation estimation in the case of facial occlusion. - Experimental results show the effectiveness of the proposed method for facial landmark detection, pose estimation, and deformation estimation in the case of facial occlusion. In conclusion, this paper aims to solve the comprehensive challenges of multiple tasks in facial behavior analysis under facial occlusion conditions through a unified and robust framework.

Simultaneous Facial Landmark Detection, Pose and Deformation Estimation under Facial Occlusion

A Cross-Dimension Annotations Method for 3D Structural Facial Landmark Extraction

Robust Three-step Facial Landmark Localization under the Complicated Condition via ASM and POEM.

Face recognition with contiguous occlusion using linear regression and level set method

Face Sketch Landmarks Localization in the Wild

A Real-Time Multi-Task Learning System for Joint Detection of Face, Facial Landmark and Head Pose

Joint Multi-View Face Alignment in the Wild

Automatic facial expression recognition on a single 3D face by exploring shape deformation.

Joint Head Pose and Facial Landmark Regression from Depth Images

Simultaneous Facial Feature Tracking and Facial Expression Recognition.

Constrained Joint Cascade Regression Framework for Simultaneous Facial Action Unit Recognition and Facial Landmark Detection

Combining Data-driven and Model-driven Methods for Robust Facial Landmark Detection

Facial Image Deformation Based on Landmark Detection

FAST FACIAL LANDMARK DETECTION USING CASCADE CLASSIFIERS AND A SIMPLE 3D MODEL

Landmarks-assisted Collaborative Deep Framework for Automatic 4D Facial Expression Recognition.

Facial Landmark Detection: a Literature Survey

MOS: A Low Latency and Lightweight Framework for Face Detection, Landmark Localization, and Head Pose Estimation

Deep Structured Prediction for Facial Landmark Detection

Robust and Precise Facial Landmark Detection by Self-Calibrated Pose Attention Network

Simultaneous face detection and 360 degree headpose estimation

3-D Facial Landmarks Detection for Intelligent Video Systems