Urban Visual Appeal According to ChatGPT: Contrasting AI and Human Insights

Milad Malekzadeh,Elias Willberg,Jussi Torkko,Tuuli Toivonen
2024-06-29
Abstract:The visual appeal of urban environments significantly impacts residents' satisfaction with their living spaces and their overall mood, which in turn, affects their health and well-being. Given the resource-intensive nature of gathering evaluations on urban visual appeal through surveys or inquiries from residents, there is a constant quest for automated solutions to streamline this process and support spatial planning. In this study, we applied an off-the-shelf AI model to automate the analysis of urban visual appeal, using over 1,800 Google Street View images of Helsinki, Finland. By incorporating the GPT-4 model with specified criteria, we assessed these images. Simultaneously, 24 participants were asked to rate the images. Our results demonstrated a strong alignment between GPT-4 and participant ratings, although geographic disparities were noted. Specifically, GPT-4 showed a preference for suburban areas with significant greenery, contrasting with participants who found these areas less appealing. Conversely, in the city centre and densely populated urban regions of Helsinki, GPT-4 assigned lower visual appeal scores than participant ratings. While there was general agreement between AI and human assessments across various locations, GPT-4 struggled to incorporate contextual nuances into its ratings, unlike participants, who considered both context and features of the urban environment. The study suggests that leveraging AI models like GPT-4 allows spatial planners to gather insights into the visual appeal of different areas efficiently, aiding decisions that enhance residents' and travellers' satisfaction and mental health. Although AI models provide valuable insights, human perspectives are essential for a comprehensive understanding of urban visual appeal. This will ensure that planning and design decisions promote healthy living environments effectively.
Human-Computer Interaction,Artificial Intelligence,Computers and Society
What problem does this paper attempt to address?
The paper attempts to address the effectiveness and limitations of evaluating urban visual attractiveness, particularly by automating the analysis of urban visual attractiveness using multimodal large language models (MLLMs) such as GPT-4, and comparing it with human evaluations. Specifically, the study aims to: 1. **Explore the potential of AI models in assessing urban visual attractiveness**: Utilizing the GPT-4 model to evaluate over 1,800 Google Street View images of Helsinki, the study investigates the performance of the AI model in this field through three different levels of prompt complexity (ranging from simple overall visual attractiveness scores to comprehensive evaluations incorporating physical features, urban design quality, and subjective responses). 2. **Compare the consistency and differences between AI and human evaluations**: By having 24 participants (divided into resident and non-resident groups) rate the same set of images, the study statistically analyzes the similarities and differences between AI model and human evaluations, particularly in terms of geographic distribution. 3. **Analyze the performance of AI models in different urban areas**: The study finds that the GPT-4 model's ratings significantly differ from human evaluations in suburban and dense urban areas. GPT-4 prefers greener suburban areas, while humans find these areas less attractive compared to downtown and densely populated areas. 4. **Explore the application prospects of AI models in urban planning**: Although AI models can efficiently generate environmental quality indicators, they still have limitations in understanding local context and integrating human perception. The study suggests that future models should be specifically trained to better understand and reflect human perceptions of urban elements, thereby supporting urban planning and design decisions. In summary, by comparing the performance of AI and humans in evaluating urban visual attractiveness, the paper reveals the strengths and limitations of AI models, providing valuable insights for urban planning and design.