Using Text-to-Image Generation for Architectural Design Ideation

Ville Paananen,Jonas Oppenlaender,Aku Visuri
DOI: https://doi.org/10.1177/14780771231222783
2023-04-20
Abstract:The recent progress of text-to-image generation has been recognized in architectural design. Our study is the first to investigate the potential of text-to-image generators in supporting creativity during the early stages of the architectural design process. We conducted a laboratory study with 17 architecture students, who developed a concept for a culture center using three popular text-to-image generators: Midjourney, Stable Diffusion, and DALL-E. Through standardized questionnaires and group interviews, we found that image generation could be a meaningful part of the design process when design constraints are carefully considered. Generative tools support serendipitous discovery of ideas and an imaginative mindset, enriching the design process. We identified several challenges of image generators and provided considerations for software development and educators to support creativity and emphasize designers' imaginative mindset. By understanding the limitations and potential of text-to-image generators, architects and designers can leverage this technology in their design process and education, facilitating innovation and effective communication of concepts.
Human-Computer Interaction,Artificial Intelligence,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper attempts to address the issue of how to utilize text-to-image generation technology to support creative and conceptual design in the early stages of architectural design. Specifically, the study focuses on the following aspects: 1. **How can text-to-image generators support creativity and concept generation in the early stages of architectural design?** - The study explores the application of text-to-image generators in the architectural design process, particularly in the initial stages of creativity and concept generation. 2. **What is the effectiveness of existing text-to-image generators in architectural design, and what improvements can future developers consider?** - The study evaluates the performance of existing text-to-image generators in architectural design and proposes suggestions for future developers. 3. **What are the typical challenges faced by novice users when using text-to-image generators?** - The study identifies the main issues encountered by novice users when using these tools and provides corresponding solutions. Through laboratory research, the authors investigated the process of 17 architecture students using three popular text-to-image generators—Midjourney, Stable Diffusion, and DALL-E—to design concepts for a cultural center. The results indicate that these generators can play a significant role in the design process, especially in inspiring creativity and imagination. However, the study also highlights some challenges in the practical application of these tools, such as issues with the quality and accuracy of generated images, and the need for users to possess certain skills to effectively use these tools.