Bias of AI-generated content: an examination of news produced by large language models

Xiao Fang,Shangkun Che,Minjia Mao,Hongzhe Zhang,Ming Zhao,Xiaohang Zhao
DOI: https://doi.org/10.1038/s41598-024-55686-2
IF: 4.6
2024-03-04
Scientific Reports
Abstract:Large language models (LLMs) have the potential to transform our lives and work through the content they generate, known as AI-Generated Content (AIGC). To harness this transformation, we need to understand the limitations of LLMs. Here, we investigate the bias of AIGC produced by seven representative LLMs, including ChatGPT and LLaMA. We collect news articles from The New York Times and Reuters, both known for their dedication to provide unbiased news. We then apply each examined LLM to generate news content with headlines of these news articles as prompts, and evaluate the gender and racial biases of the AIGC produced by the LLM by comparing the AIGC and the original news articles. We further analyze the gender bias of each LLM under biased prompts by adding gender-biased messages to prompts constructed from these news headlines. Our study reveals that the AIGC produced by each examined LLM demonstrates substantial gender and racial biases. Moreover, the AIGC generated by each LLM exhibits notable discrimination against females and individuals of the Black race. Among the LLMs, the AIGC generated by ChatGPT demonstrates the lowest level of bias, and ChatGPT is the sole model capable of declining content generation when provided with biased prompts.
multidisciplinary sciences
What problem does this paper attempt to address?
This paper aims to explore the issue of bias in content generated by large language models (LLMs), particularly gender bias and racial bias. The study selected seven representative large language models, including ChatGPT and LLaMA, for analysis. By comparing the news content generated by these models with the original news articles published by The New York Times and Reuters, the study assessed the degree of bias at the lexical, sentence, and document levels. The main findings are as follows: - At the lexical level, all tested LLMs exhibited significant gender and racial bias. Among them, ChatGPT performed the best in reducing bias. - At the sentence level, the study also evaluated the sentiment tendencies towards different groups and found that most of the content generated by the models exhibited negative sentiment bias towards women and the Black race. - The study further analyzed the performance of each model under biased prompts and pointed out that ChatGPT has a certain ability to refuse to generate biased content. In summary, this paper reveals that current large language models still have serious issues with gender and racial bias in content generation and emphasizes the importance of improving models to reduce these biases.