Gender bias in generative artificial intelligence text-to-image depiction of medical students

Geoffrey Currie,Josie Currie,Sam Anderson,Johnathan Hewis
DOI: https://doi.org/10.1177/00178969241274621
2024-08-30
Health Education Journal
Abstract:Health Education Journal, Ahead of Print. Introduction:In Australia, 54.3% of medical students are women yet they remain under-represented in stereotypical perspectives of medicine. While potentially transformative, generative artificial intelligence (genAI) has the potential for errors, misrepresentations and bias. GenAI text-to-image production could reinforce gender biases making it important to evaluate DALL-E 3 (the text-to-image genAI supported through ChatGPT) representations of Australian medical students.Method:In March 2024, DALL-E 3 was utilised via GPT-4 to generate a series of individual and group images of medical students, specifically Australian undergraduate medical students to eliminate potential confounders. Multiple iterations of images were generated using a variety of prompts. Collectively, 47 images were produced for evaluation of which 33 were individual characters and the remaining 14 images were comprised of multiple (5 to 67) characters. All images were independently analysed by three reviewers for apparent gender and skin tone. Consequently, 33 feature individuals were evaluated and a further 417 characters in groups were evaluated (N = 448). Discrepancies in responses were resolved by consensus.Results:Collectively (individual and group images), 58.8% (N = 258) of medical students were depicted as men, 39.9% (N = 175) as women, 92.0% (N = 404) with a light skin tone, 7.7% (N = 34) with mid skin tone and 0% with dark skin tone. The gender distribution was a statistically significant variation from that of actual Australian medical students for individual images, for group images and for collective images. Among the images of individual medical students (N = 25), DALL-E 3 generated 92% (N = 23) as men and 100% were of light skin tone (N = 25).Conclusion:This evaluation reveals the gender associated with genAI text-to-image generation using DALL-E 3 among Australian undergraduate medical students. Generated images included a disproportionately high proportion of white male medical students which is not representative of the diversity of medical students in Australia. The use of DALL-E 3 to produce depictions of medical students for education or promotion purposes should be done with caution.
public, environmental & occupational health,education & educational research
What problem does this paper attempt to address?