Abstract:Regression-based face alignment involves learning a series of mapping functions to predict the true landmarks from an initial estimation of the alignment. Most existing approaches focus on learning efficacious mapping functions from some feature representations to improve performance. The issues related to the initial alignment estimation and the final learning objective, however, receive less attention. This work proposes a deep regression architecture with progressive reinitialization and a new error-driven learning loss function to explicitly address the above two issues. Given an image with a rough face detection result, the full face region is first mapped by a supervised spatial transformer network to a normalized form and trained to regress coarse positions of landmarks. Then, different face parts are further respectively reinitialized to their own normalized states, followed by another regression sub-network to refine the landmark positions. To deal with the inconsistent annotations in existing training datasets, we further propose an adaptive landmark-weighted loss function. It dynamically adjusts the importance of different landmarks according to their learning errors during training without depending on any hyper-parameters manually set by trial and error. A high level of robustness to annotation inconsistencies is thus achieved. The whole deep architecture permits training from end to end, and extensive experimental analyses and comparisons demonstrate its effectiveness and efficiency. The source code, trained models, and experimental results are made available at https://github.com/shaoxiaohu/Face_Alignment_DPR.git.

Dynamic Deformable Transformer for End‐to‐end Face Alignment

A Cross-Dimension Annotations Method for 3D Structural Facial Landmark Extraction

Cascade of Forests for Face Alignment

End-to-End Spatial Transform Face Detection and Recognition

SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Dense Face Alignment and Reconstruction

Robust Face Alignment via Deep Progressive Reinitialization and Adaptive Error-driven Learning

Direct Shape Regression Networks For End-To-End Face Alignment

Learning spatial-temporal deformable networks for unconstrained face alignment and tracking in videos

Quality-aware Face Alignment Using High-Resolution Spatial Dependencies

Dynamic Cascaded Regression Network with Reinforcement Learning for Robust Face Alignment

ADNet: Leveraging Error-Bias Towards Normal Direction in Face Alignment

CephalFormer: Incorporating Global Structure Constraint into Visual Features for General Cephalometric Landmark Detection

Towards Highly Accurate and Stable Face Alignment for High-Resolution Videos.

Efficient and Accurate Face Alignment by Global Regression and Cascaded Local Refinement.

Learning Deformable Hourglass Networks (Dhgn) For Unconstrained Face Alignment

Feedback Cascade Regression Model for Face Alignment

Salient-points-guided Face Alignment

Joint Face Detection and Alignment with a Deformable Hough Transform Model

Adaptive Weighted Face Alignment by Multi-Scale Feature and Offset Prediction

Face Alignment with Two-Layer Shape Regression

Face Alignment with Deep Regression.