Representation Bias of Adolescents in AI: A Bilingual, Bicultural Study

Robert Wolfe,Aayushi Dangol,Bill Howe,Alexis Hiniker
2024-08-04
Abstract:Popular and news media often portray teenagers with sensationalism, as both a risk to society and at risk from society. As AI begins to absorb some of the epistemic functions of traditional media, we study how teenagers in two countries speaking two languages: 1) are depicted by AI, and 2) how they would prefer to be depicted. Specifically, we study the biases about teenagers learned by static word embeddings (SWEs) and generative language models (GLMs), comparing these with the perspectives of adolescents living in the U.S. and Nepal. We find English-language SWEs associate teenagers with societal problems, and more than 50% of the 1,000 words most associated with teenagers in the pretrained GloVe SWE reflect such problems. Given prompts about teenagers, 30% of outputs from GPT2-XL and 29% from LLaMA-2-7B GLMs discuss societal problems, most commonly violence, but also drug use, mental illness, and sexual taboo. Nepali models, while not free of such associations, are less dominated by social problems. Data from workshops with N=13 U.S. adolescents and N=18 Nepalese adolescents show that AI presentations are disconnected from teenage life, which revolves around activities like school and friendship. Participant ratings of how well 20 trait words describe teens are decorrelated from SWE associations, with Pearson's r=.02, n.s. in English FastText and r=.06, n.s. in GloVe; and r=.06, n.s. in Nepali FastText and r=-.23, n.s. in GloVe. U.S. participants suggested AI could fairly present teens by highlighting diversity, while Nepalese participants centered positivity. Participants were optimistic that, if it learned from adolescents, rather than media sources, AI could help mitigate stereotypes. Our work offers an understanding of the ways SWEs and GLMs misrepresent a developmentally vulnerable group and provides a template for less sensationalized characterization.
Computers and Society,Artificial Intelligence,Computation and Language,Human-Computer Interaction,Machine Learning
What problem does this paper attempt to address?
The paper primarily explores how Artificial Intelligence (AI) systems represent the adolescent group and analyzes whether these representations reflect societal stereotypes and biases about teenagers. Specifically, the study focuses on the following aspects: 1. **Research Background**: Media often exaggerates depictions of teenagers, portraying them as sources of societal risk or victims. This portrayal influences adults' perceptions of teenagers and may affect related policy-making. 2. **Research Objectives**: The paper aims to investigate how Static Word Embeddings (SWEs) and Generative Language Models (GLMs) learn and reflect societal attitudes and biases about teenagers. 3. **Research Methods**: - Analyze SWEs and GLMs in English and Nepali, comparing them with teenagers' self-perceptions. - Collect viewpoints from American and Nepali teenagers through workshops to understand how they perceive AI representations of teenagers and what they consider fair representation. 4. **Key Findings**: - English SWEs and GLMs tend to associate teenagers with societal issues such as drugs, rebellious behavior, violence, etc., with these issues accounting for more than half of the most relevant terms related to teenagers. - Nepali models also show similar associations but to a lesser extent. - In reality, teenagers' lives mainly revolve around school and friendships, which significantly differ from AI models' representations. - Teen participants emphasized two core elements for fair representation of teenagers in AI: diversity (viewpoint of American teenagers) and positive traits (viewpoint of Nepali teenagers). 5. **Conclusion**: The study indicates that current AI models have absorbed negative stereotypes about teenagers from the media. To ensure that teenagers are represented more fairly, future research should adopt more diverse approaches, including directly listening to teenagers' voices to better capture their self-understanding rather than relying solely on media portrayals.