A graph neural network framework for spatial geodemographic classification

Stefano De Sabbata,Pengyuan Liu,Stefano De SabbataPengyuan Liua School of Geography,Geology and the Environment,University of Leicester,UKb School of Geographical Sciences,Nanjing University of Information Science and Technology,ChinaStefano De Sabbata is an Associate Professor of Geographical Information Science at the School of Geography,Geology and the Environment and a Fellow of the Institute for Digital Culture of the University of Leicester. Their research focuses on geographic data science and the application of artificial intelligence in human geography and digital culture studies.Pengyuan Liu is a lecturer (assistant professor level) at Nanjing University of Information Science and Technology. He holds an MSc and PhD degree from the University of Leicester in the United Kingdom.
DOI: https://doi.org/10.1080/13658816.2023.2254382
2023-10-04
International Journal of Geographical Information Science
Abstract:Geodemographic classifications are exceptional tools for geographic analysis, business and policy-making, providing an overview of the socio-demographic structure of a region by creating an unsupervised, bottom-up classification of its areas based on a large set of variables. Classic approaches can require time-consuming preprocessing of input variables and are frequently a-spatial processes. In this study, we present a groundbreaking, systematic investigation of the use of graph neural networks for spatial geodemographic classification. Using Greater London as a case study, we compare a range of graph autoencoder designs with the official London Output Area Classification and baseline classifications developed using spatial fuzzy c-means. The results show that our framework based on a Node Attributes-focused Graph AutoEncoder (NAGAE) can perform similarly to classic approaches on class homogeneity metrics while providing higher spatial clustering. We conclude by discussing the current limitations of the proposed framework and its potential to develop into a new paradigm for creating a range of geodemographic classifications, from simple, local ones to complex classifications able to incorporate a range of spatial relationships into the process.
geography, physical,computer science, information systems,information science & library science
What problem does this paper attempt to address?