Abstract:This study analyzed images generated by three popular generative artificial intelligence (AI) tools - Midjourney, Stable Diffusion, and DALLE 2 - representing various occupations to investigate potential bias in AI generators. Our analysis revealed two overarching areas of concern in these AI generators, including (1) systematic gender and racial biases, and (2) subtle biases in facial expressions and appearances. Firstly, we found that all three AI generators exhibited bias against women and African Americans. Moreover, we found that the evident gender and racial biases uncovered in our analysis were even more pronounced than the status quo when compared to labor force statistics or Google images, intensifying the harmful biases we are actively striving to rectify in our society. Secondly, our study uncovered more nuanced prejudices in the portrayal of emotions and appearances. For example, women were depicted as younger with more smiles and happiness, while men were depicted as older with more neutral expressions and anger, posing a risk that generative AI models may unintentionally depict women as more submissive and less competent than men. Such nuanced biases, by their less overt nature, might be more problematic as they can permeate perceptions unconsciously and may be more difficult to rectify. Although the extent of bias varied depending on the model, the direction of bias remained consistent in both commercial and open-source AI generators. As these tools become commonplace, our study highlights the urgency to identify and mitigate various biases in generative AI, reinforcing the commitment to ensuring that AI technologies benefit all of humanity for a more inclusive future.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the possible bias in the images generated by generative artificial intelligence (AI) tools. Specifically, the author analyzed the images generated by three popular generative AI tools - Midjourney, Stable Diffusion and DALL·E 2 - and explored the potential gender and racial biases of these tools in representing different occupations, as well as the subtle biases in facial expressions and appearances.
### Main research questions:
1. **Systematic gender and racial biases**:
- The study found that all three AI generators have biases against women and African - Americans.
- These biases are even more obvious than the statistical data in the real world in some cases, exacerbating the harmful biases that society is trying to correct.
2. **Subtle biases in facial expressions and appearances**:
- Women are usually depicted as younger, more smiling and happier, while men are depicted as older, with more neutral or angry expressions.
- This subtle bias may inadvertently depict women as more submissive and less capable, thus unconsciously permeating into people's perceptions and being more difficult to correct.
### Research background:
- **Applications of generative AI**: Generative AI plays an important role in many fields such as business and education, and can generate high - quality texts, codes, images and videos.
- **Potential risks**: Although generative AI has many benefits, there are also some potential risks, including intellectual property issues, output accuracy, result interpretability and the possible spread of harmful biases.
### Research methods:
- **Data generation**: Use Midjourney, Stable Diffusion and DALL·E 2 to generate about 8,000 images of different occupations.
- **Data analysis**: Detect facial features in the images, including gender, smile, emotion and age, through the Face++ API, and evaluate the gender and racial distributions.
- **Benchmark comparison**: Compare with the data of BLS labor statistics and Google image search to evaluate the degree of bias in the generated images.
### Research results:
1. **Gender bias**:
- In the images generated by all three AI generators, the representation of women is significantly lower than that of men.
- For example, in the images generated by Midjourney, women account for only 23%, while men account for 77%.
2. **Racial bias**:
- Blacks are significantly less represented in the generated images than whites.
- For example, in the images generated by Midjourney, blacks account for only 9%, while whites account for more than 50%.
3. **Facial expression and appearance bias**:
- Women are usually depicted as younger, more smiling and happier, while men are depicted as older, with more neutral or angry expressions.
- This subtle bias may reinforce harmful gender stereotypes.
### Conclusions:
- **Urgency**: With the wide application of generative AI tools in various fields, it is particularly urgent to identify and correct these biases.
- **Ethical and technical challenges**: It is necessary to find a balance between technological development and ethical commitment to ensure that generative AI systems are not only technologically advanced, but also inclusive and fair.
Through this study, the author emphasizes the importance of ensuring fairness and inclusiveness in the design and deployment of generative AI systems to avoid further exacerbating social biases and stereotypes.