Abstract:Facial landmark detection aims at localizing multiple keypoints for a given facial image, which usually suffers from variations caused by arbitrary pose, diverse facial expression and partial occlusion. In this paper, we develop a two-stage regression network for facial landmark detection on unconstrained conditions. Our model consists of a Structural Hourglass Network (SHN) for detecting the initial locations of all facial landmarks based on heatmap generation, and a Global Constraint Network (GCN) for further refining the detected locations based on offset estimation. Specifically, SHN introduces an improved Inception-ResNet unit as basic building block, which can effectively improve the receptive field and learn contextual feature representations. In the meanwhile, a novel loss function with adaptive weight is proposed to make the whole model focus on the hard landmarks precisely. GCN attempts to explore the spatial contextual relationship between facial landmarks and refine the initial locations of facial landmarks by optimizing the global constraint. Moreover, we develop a pre-processing network to generate features with different scales, which will be transmitted to SHN and GCN for effective feature representations. Different from existing models, the proposed method realizes the heatmap-offset framework, which combines the outputs of heatmaps generated by SHN and coordinates estimated by GCN, to obtain an accurate prediction. The extensive experimental results on several challenging datasets, including 300W, COFW, AFLW, and 300-VW confirm that our method achieve competitive performance compared with the state-of-the-art algorithms.

Improved Cnn-Based Facial Landmarks Tracking Via Ridge Regression at 150 Fps on Mobile Devices

A Cross-Dimension Annotations Method for 3D Structural Facial Landmark Extraction

Real-Time Facial Landmark Detection by Attention-driven Lightweight Network

A Robust Facial Landmark Detector with Mixed Loss

Robust Facial Landmark Detection and Tracking Across Poses and Expressions for In-the-wild Monocular Video

Efficient conditioned face animation using frontally-viewed embedding

Towards Highly Accurate and Stable Face Alignment for High-Resolution Videos.

Facial Landmark Detection with Tweaked Convolutional Neural Networks

106-Point Facial Landmark Localization with Mobile Networks Based on Regression.

Robust Facial Landmark Detection Via Heatmap-Offset Regression

Towards efficient masked-face alignment via cascaded regression

Knowing When to Quit: Selective Cascaded Regression with Patch Attention for Real-Time Face Alignment

PFLD: A Practical Facial Landmark Detector.

Accurate Landmarking from 3D Facial Scans by CNN and Cascade Regression.

Fast Head Pose Estimation Via Rotation-Adaptive Facial Landmark Detection for Video Edge Computation.

A Failure-Aware Explicit Shape Regression Model for Facial Landmark Detection in Video

Facial Landmark Localization by Enhanced Convolutional Neural Network.

Towards Stabilizing Facial Landmark Detection and Tracking Via Hierarchical Filtering: A New Method

Stabilizing video facial landmark detection and tracking via global and local filtering.

Automatically Detecting Rigidly and Nonrigidly Deformed Facial Landmarks from Coarseness to Fineness.

FAST FACIAL LANDMARK DETECTION USING CASCADE CLASSIFIERS AND A SIMPLE 3D MODEL