Real-Time Facial Landmark Detection by Attention-driven Lightweight Network

Lei Li,Lifang Zhou
DOI: https://doi.org/10.1109/imcec51613.2021.9482202
2021-01-01
Abstract:Facial landmark detection is a basic challenge in computer vision. Currently, CNNs are the most effective methods for facial landmark detection. However, this works usually bring in a mass of model parameters, lead to high computing resource cost. To simultaneously consider the accuracy and compactness of model, we propose a new model, namely attention-driven lightweight face alignment network(ALFAN), using MobileNetV3 block as backbone network. The proposed ALFAN, with 6.5Mb of model size and fast processing speed, achieves satisfactory precision compared with mainstream approach. Moreover, in order to make the most of feature maps from different layers, the channel attention block and spatial attention block are used for the high-level feature maps and the low-level feature maps respectively. In addition, we design a geometric-wing loss to guide the network to learn more facial geometric/structural information. The evaluation on 300W challenging facial landmark dataset show that our algorithm performs better than some state-of-the-art methods.
What problem does this paper attempt to address?