Deep learning and face recognition: the state of the art

Stephen Balaban
DOI: https://doi.org/10.1117/12.2181526
2019-02-10
Abstract:Deep Neural Networks (DNNs) have established themselves as a dominant technique in machine learning. DNNs have been top performers on a wide variety of tasks including image classification, speech recognition, and face recognition. Convolutional neural networks (CNNs) have been used in nearly all of the top performing methods on the Labeled Faces in the Wild (LFW) dataset. In this talk and accompanying paper, I attempt to provide a review and summary of the deep learning techniques used in the state-of-the-art. In addition, I highlight the need for both larger and more challenging public datasets to benchmark these systems. The high accuracy (99.63% for FaceNet at the time of publishing) and utilization of outside data (hundreds of millions of images in the case of Google's FaceNet) suggest that current face verification benchmarks such as LFW may not be challenging enough, nor provide enough data, for current techniques. There exist a variety of organizations with mobile photo sharing applications that would be capable of releasing a very large scale and highly diverse dataset of facial images captured on mobile devices. Such an "ImageNet for Face Recognition" would likely receive a warm welcome from researchers and practitioners alike.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper aims to explore and summarize the latest progress of deep - learning techniques in the field of face recognition, and to raise some problems existing in current face - recognition research and future development directions. Specifically: 1. **Review of deep - learning techniques**: The paper reviews the application and development of deep - learning techniques, especially convolutional neural networks (CNNs) in face recognition. These techniques have significantly improved the accuracy of face - recognition systems, and even outperformed humans in some cases. 2. **Evaluation of existing datasets**: The paper points out that existing public datasets such as LFW (Labeled Faces in the Wild) may no longer be challenging enough for the current technological level because their data volume is relatively small and their diversity is insufficient. This limits researchers' ability to effectively evaluate face - recognition performance in more complex scenarios. 3. **Proposing new dataset requirements**: The author emphasizes the need for larger - scale and more diverse public datasets to promote the development of face - recognition techniques. Such datasets can better reflect the complexity and diversity in practical application scenarios, thus promoting the research and development of more robust face - recognition algorithms. 4. **Technical bottlenecks and future directions**: Although deep learning has achieved remarkable results in face recognition, the paper also points out some limitations of the current technology, such as the need for large - scale data and the dependence on specific alignment techniques. The author suggests that future research should focus on how to reduce the dependence on large - scale data and develop more general feature - learning methods. In conclusion, this paper not only summarizes the latest progress of current deep learning in the field of face recognition, but also points out the direction of future research, especially in terms of dataset construction and algorithm optimization.