Text-to-Image Generation: Perceptions and Realities

Jonas Oppenlaender,Aku Visuri,Ville Paananen,Rhema Linder,Johanna Silvennoinen
2023-05-02
Abstract:Generative AI is an emerging technology that will have a profound impact on society and individuals. Only a decade ago, it was thought that creative work would be among the last to be automated - yet today, we see AI encroaching on creative domains. In this paper, we present the key findings of a survey study on people's perceptions of text-to-image generation. We touch on participants' technical understanding of the emerging technology, their ideas for potential application areas, as well as concerns, risks, and dangers of text-to-image generation to society and the individual. The study found that participants were aware of the risks and dangers associated with the technology, but only few participants considered the technology to be a risk to themselves. Additionally, those who had tried the technology rated its future importance lower than those who had not.
Human-Computer Interaction
What problem does this paper attempt to address?
The paper attempts to address the issue of understanding people's perceptions and views on text-to-image generation technology. Specifically, the researchers conducted a survey to explore participants' technical understanding of this emerging technology, its potential application areas, and the risks and concerns at both societal and individual levels. The main objectives of the paper include: 1. **Technical Understanding**: Assessing participants' basic understanding of text-to-image generation technology, including whether they can distinguish between the training and inference processes. 2. **Potential Applications**: Understanding in which fields participants believe this technology can be applied, such as creative arts, marketing, design, entertainment, etc. 3. **Risks and Concerns**: Exploring participants' views on the potential societal and individual risks posed by this technology, such as fake news, deepfakes, unemployment, copyright infringement, etc. 4. **Professional Importance**: Analyzing participants' views on the current and future importance of this technology in their professions, especially the differences between those who have tried the technology and those who have not. Through these objectives, the researchers hope to reveal the public's multidimensional cognition of text-to-image generation technology, providing references for future policy-making and technological development.