How Good is ChatGPT at Face Biometrics? A First Look into Recognition, Soft Biometrics, and Explainability

Ivan DeAndres-Tame,Ruben Tolosana,Ruben Vera-Rodriguez,Aythami Morales,Julian Fierrez,Javier Ortega-Garcia
DOI: https://doi.org/10.1109/ACCESS.2024.3370437
2024-02-27
Abstract:Large Language Models (LLMs) such as GPT developed by OpenAI, have already shown astonishing results, introducing quick changes in our society. This has been intensified by the release of ChatGPT which allows anyone to interact in a simple conversational way with LLMs, without any experience in the field needed. As a result, ChatGPT has been rapidly applied to many different tasks such as code- and song-writer, education, virtual assistants, etc., showing impressive results for tasks for which it was not trained (zero-shot learning).
Computer Vision and Pattern Recognition,Artificial Intelligence,Computers and Society,Machine Learning
What problem does this paper attempt to address?
This paper discusses the application capability of ChatGPT in facial biometrics tasks. The study focuses on the performance of ChatGPT (based on the latest multimodal GPT-4 model) in face recognition, soft biometric feature estimation, and result interpretability. The authors evaluate the performance and robustness of ChatGPT through experiments, comparing it with the latest methods in the field using public benchmark datasets. The results show that ChatGPT has potential in facial biometrics, particularly in improving interpretability and transparency. For reproducibility, the researchers have released all the code. The paper also analyzes different configurations of ChatGPT, including image configuration and prompt configuration, to optimize its performance in facial biometrics tasks. Additionally, due to ChatGPT's lack of direct support for face recognition functionality, the researchers circumvent this limitation by adjusting the prompt information, enabling it to compare and explain facial images. Overall, the paper aims to explore the potential of large language models like ChatGPT in the field of facial recognition and emphasizes their importance in improving decision interpretability.