SDFD: Building a Versatile Synthetic Face Image Dataset with Diverse Attributes

Georgia Baltsou,Ioannis Sarridis,Christos Koutlis,Symeon Papadopoulos
2024-04-29
Abstract:AI systems rely on extensive training on large datasets to address various tasks. However, image-based systems, particularly those used for demographic attribute prediction, face significant challenges. Many current face image datasets primarily focus on demographic factors such as age, gender, and skin tone, overlooking other crucial facial attributes like hairstyle and accessories. This narrow focus limits the diversity of the data and consequently the robustness of AI systems trained on them. This work aims to address this limitation by proposing a methodology for generating synthetic face image datasets that capture a broader spectrum of facial diversity. Specifically, our approach integrates a systematic prompt formulation strategy, encompassing not only demographics and biometrics but also non-permanent traits like make-up, hairstyle, and accessories. These prompts guide a state-of-the-art text-to-image model in generating a comprehensive dataset of high-quality realistic images and can be used as an evaluation set in face analysis systems. Compared to existing datasets, our proposed dataset proves equally or more challenging in image classification tasks while being much smaller in size.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem this paper attempts to address is the lack of diversity in existing facial image datasets, particularly the neglect of non-permanent features such as hairstyles, makeup, accessories, etc. Most existing facial image datasets primarily focus on demographic factors like age, gender, and skin color, while ignoring other important facial attributes. This limits the diversity of the data and the robustness of AI systems trained on these datasets. To tackle this challenge, the paper proposes a method for generating a synthetic facial image dataset aimed at covering a broader range of facial diversity. Specifically, the method integrates a systematic prompting strategy that includes not only demographic and biometric features but also non-permanent features such as makeup, hairstyles, and accessories. These prompts guide state-of-the-art text-to-image models to generate a comprehensive dataset containing high-quality realistic images, which can be used for the evaluation of facial analysis systems. Compared to existing datasets, this dataset demonstrates equal or higher challenge in image classification tasks while being smaller in size. The dataset generated through this method—SDFD (Synthetic Diverse Face Dataset)—contains 1,000 different facial images, showcasing people of different races, genders, and ages, wearing various accessories, different types of makeup, and expressing various emotions. Despite its relatively small size, this dataset captures a wide range of different attributes, making it a challenging test set.