Generated Bias: Auditing Internal Bias Dynamics of Text-To-Image Generative Models

Abhishek Mandal,Susan Leavy,Suzanne Little
2024-10-10
Abstract:Text-To-Image (TTI) Diffusion Models such as DALL-E and Stable Diffusion are capable of generating images from text prompts. However, they have been shown to perpetuate gender stereotypes. These models process data internally in multiple stages and employ several constituent models, often trained separately. In this paper, we propose two novel metrics to measure bias internally in these multistage multimodal models. Diffusion Bias was developed to detect and measures bias introduced by the diffusion stage of the models. Bias Amplification measures amplification of bias during the text-to-image conversion process. Our experiments reveal that TTI models amplify gender bias, the diffusion process itself contributes to bias and that Stable Diffusion v2 is more prone to gender bias than DALL-E 2.
Computer Vision and Pattern Recognition,Computers and Society
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that text - to - image generation models (Text - To - Image, TTI) may amplify gender bias during the image generation process. Specifically, these models have multiple stages internally when processing data and use multiple component models, which are often trained separately. The paper points out that existing research mainly focuses on the bias audit of the overall model, while ignoring how these multi - stage and multi - modal models handle bias internally. Therefore, the paper proposes two new metrics to measure the gender bias within these models: 1. **Diffusion Bias (\(\delta\))**: Used to detect and measure the bias introduced by the diffusion process. 2. **Bias Amplification (\(\alpha\))**: Used to measure the amplification of bias during the text - to - image conversion process. Through these two metrics, the paper aims to answer the following research questions: 1. How can the bias within TTI models be effectively measured? 2. Is the bias amplified during the image generation process? What is the impact of the model architecture on this? The contributions of the paper include: 1. Introducing two new metrics based on the Multimodal Composite Association Score (MCAS) to detect and measure gender bias in TTI diffusion models. 2. Analyzing the internal bias dynamics of TTI models and the role of their architecture in bias amplification. Through these methods, the paper hopes to better understand, detect, and reduce the gender bias problem in these complex models.