Lphd: A Large-Scale Head Pose Dataset For Rgb Images

Wei Sun,Yezhao Fan,Xiongkuo Min,Shihao Peng,Siwei Ma,Guangtao Zhai
DOI: https://doi.org/10.1109/ICME.2019.00190
2019-01-01
Abstract:Head pose estimation has attracted many research interest in recent years. With the advent of deep learning, it is possible to predict the head pose accurately from the RGB images without the help of facial landmarks or depth information. However, existing head pose datasets often lack large pose head images, which extremely limits the development of head pose estimation algorithms. In this paper, we build the large-scale head pose dataset (LHPD) including more than 140,000 images with the diverse and accurate head poses. The LHPD dataset includes the head images recorded from different shooting angles between the camera and the human body for the first time, which greatly expands the range of head pose compared to previous datasets. Therefore, the range of head pose can cover +/- 90. for each Euler angle. The accurate and reliable head pose annotation is labeled by the motion capture system and careful calibration procedures. We then propose a head pose estimation method through fine-tuning the ResNet on the LHPD dataset when using the Euclidean distance of quaternions as the loss function. The results show that our method achieves better performance than current state-of-the-art algorithms.
What problem does this paper attempt to address?