Ivan DeAndres-Tame,Ruben Tolosana,Pietro Melzi,Ruben Vera-Rodriguez,Minchul Kim,Christian Rathgeb,Xiaoming Liu,Luis F. Gomez,Aythami Morales,Julian Fierrez,Javier Ortega-Garcia,Zhizhou Zhong,Yuge Huang,Yuxi Mi,Shouhong Ding,Shuigeng Zhou,Shuai He,Lingzhi Fu,Heng Cong,Rongyu Zhang,Zhihong Xiao,Evgeny Smirnov,Anton Pimenov,Aleksei Grigorev,Denis Timoshenko,Kaleb Mesfin Asfaw,Cheng Yaw Low,Hao Liu,Chuyi Wang,Qing Zuo,Zhixiang He,Hatef Otroshi Shahreza,Anjith George,Alexander Unnervik,Parsa Rahimi,Sébastien Marcel,Pedro C. Neto,Marco Huber,Jan Niklas Kolf,Naser Damer,Fadi Boutros,Jaime S. Cardoso,Ana F. Sequeira,Andrea Atzori,Gianni Fenu,Mirko Marras,Vitomir Štruc,Jiang Yu,Zhangjie Li,Jichun Li,Weisong Zhao,Zhen Lei,Xiangyu Zhu,Xiao-Yu Zhang,Bernardo Biesseck,Pedro Vidal,Luiz Coelho,Roger Granada,David Menotti

Abstract:Synthetic data is gaining increasing popularity for face recognition technologies, mainly due to the privacy concerns and challenges associated with obtaining real data, including diverse scenarios, quality, and demographic groups, among others. It also offers some advantages over real data, such as the large amount of data that can be generated or the ability to customize it to adapt to specific problem-solving needs. To effectively use such data, face recognition models should also be specifically designed to exploit synthetic data to its fullest potential. In order to promote the proposal of novel Generative AI methods and synthetic data, and investigate the application of synthetic data to better train face recognition systems, we introduce the 2nd FRCSyn-onGoing challenge, based on the 2nd Face Recognition Challenge in the Era of Synthetic Data (FRCSyn), originally launched at CVPR 2024. This is an ongoing challenge that provides researchers with an accessible platform to benchmark i) the proposal of novel Generative AI methods and synthetic data, and ii) novel face recognition systems that are specifically proposed to take advantage of synthetic data. We focus on exploring the use of synthetic data both individually and in combination with real data to solve current challenges in face recognition such as demographic bias, domain adaptation, and performance constraints in demanding situations, such as age disparities between training and testing, changes in the pose, or occlusions. Very interesting findings are obtained in this second edition, including a direct comparison with the first one, in which synthetic databases were restricted to DCFace and GANDiffFace.

What problem does this paper attempt to address?

### What problems does this paper attempt to solve? This paper aims to solve several key challenges in face recognition technology, especially the problems when using synthetic data to train and improve face recognition systems. Specifically, by organizing the 2nd "Face Recognition Challenge Based on Synthetic Data (2ndFRCSyn - onGoing)", the paper explores the following two main problems: 1. **Demographic Bias Mitigation**: - The paper attempts to use synthetic data to alleviate the performance differences of face recognition systems among different demographic groups such as races and genders. Traditional real - data sets may lead to lower recognition rates for some groups due to uneven sample distribution, while synthetic data can generate diverse facial images, thus better representing different groups. 2. **Overall Performance Improvement**: - The paper also explores how to use synthetic data to improve the performance of face recognition systems under various complex conditions, such as age differences, pose changes, and occlusions. These challenges are very common in practical application scenarios, but traditional training methods are often difficult to deal with. To achieve these goals, the paper introduces a series of subtasks, allowing researchers to use different generation frameworks to create synthetic data and evaluate the impact of these data on face recognition systems. In addition, the paper also provides detailed experimental settings and evaluation metrics to ensure the comparability and reliability of the results. ### Key Research Questions 1. **What are the limitations of face recognition technology trained only with synthetic data?** - This question explores the performance of systems that rely entirely on synthetic data for training in practical applications and the gap compared to systems trained with real data. 2. **Can synthetic data help alleviate the limitations in current face recognition technology?** - By comparing the system performance under different conditions (such as using only synthetic data, combining real and synthetic data), the paper evaluates the potential of synthetic data in improving face recognition technology. The raising and answering of these questions are of great significance for promoting the development of face recognition technology. Especially in the context of increasingly strict privacy protection regulations, synthetic data has become an attractive alternative.

Second FRCSyn-onGoing: Winning Solutions and Post-Challenge Analysis to Improve Face Recognition with Synthetic Data

Second Edition FRCSyn Challenge at CVPR 2024: Face Recognition Challenge in the Era of Synthetic Data

FRCSyn-onGoing: Benchmarking and comprehensive evaluation of real and synthetic data to improve face recognition systems

FRCSyn Challenge at WACV 2024:Face Recognition Challenge in the Era of Synthetic Data

SDFR: Synthetic Data for Face Recognition Competition

SynFace: Face Recognition with Synthetic Data

Face Recognition Using Synthetic Face Data

Synthetic Data for Face Recognition: Current State and Future Prospects

GANDiffFace: Controllable Generation of Synthetic Datasets for Face Recognition with Realistic Variations

If It's Not Enough, Make It So: Reducing Authentic Data Demand in Face Recognition through Synthetic Faces

Synthetic Data for the Mitigation of Demographic Biases in Face Recognition

On the use of automatically generated synthetic image datasets for benchmarking face recognition

Digi2Real: Bridging the Realism Gap in Synthetic Data Face Recognition via Foundation Models

SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data

Identity-driven Three-Player Generative Adversarial Network for Synthetic-based Face Recognition

DCFace: Synthetic Face Generation with Dual Condition Diffusion Model

Synthetic Face Datasets Generation via Latent Space Exploration from Brownian Identity Diffusion

The Impact of Balancing Real and Synthetic Data on Accuracy and Fairness in Face Recognition

Domain-Specific Face Synthesis for Video Face Recognition from a Single Sample Per Person

Ethics-aware face recognition aided by synthetic face images