Homogenization Effects of Large Language Models on Human Creative Ideation

Barrett R. Anderson,Jash Hemant Shah,Max Kreminski
DOI: https://doi.org/10.1145/3635636.3656204
2024-05-11
Abstract:Large language models (LLMs) are now being used in a wide variety of contexts, including as creativity support tools (CSTs) intended to help their users come up with new ideas. But do LLMs actually support user creativity? We hypothesized that the use of an LLM as a CST might make the LLM's users feel more creative, and even broaden the range of ideas suggested by each individual user, but also homogenize the ideas suggested by different users. We conducted a 36-participant comparative user study and found, in accordance with the homogenization hypothesis, that different users tended to produce less semantically distinct ideas with ChatGPT than with an alternative CST. Additionally, ChatGPT users generated a greater number of more detailed ideas, but felt less responsible for the ideas they generated. We discuss potential implications of these findings for users, designers, and developers of LLM-based CSTs.
Human-Computer Interaction,Artificial Intelligence
What problem does this paper attempt to address?
This paper attempts to explore the role of large - language models (LLMs) as creativity - support tools (CSTs) in promoting the human creative - generation process, especially whether these models will lead to more homogenized ideas generated by different users. Specifically, the researchers hypothesized that using an LLM as a CST might make users feel more creative and expand the creative range of each individual user, but at the same time, it would also make the ideas among different users more similar. To test this hypothesis, the researchers conducted a comparative user study with 36 participants, using two different CSTs: ChatGPT and a non - AI CST (Oblique Strategies cards). The study mainly focused on the following research questions: 1. **RQ1**: At the group level, which CST will participants use to generate more semantically similar ideas? (Answer: ChatGPT) 2. **RQ2**: At the individual level, which CST will participants use to generate more semantically similar ideas? (Answer: No significant difference) 3. **RQ3**: Do participants using ChatGPT feel more or less responsible for their ideas? (Answer: Less responsible) 4. **RQ4**: Besides originality, are there differences in other creativity dimensions (such as fluency, flexibility, and elaboration) between participants using ChatGPT and non - ChatGPT? (Answer: Participants using ChatGPT showed higher fluency, flexibility, and elaboration) Through this study, the authors aim to gain a deep understanding of the homogenization effects that LLMs may produce in the process of human - AI co - creation, and to explore the potential causes of these effects and their significance for users, designers, and developers.