Beyond satellite imagery: the influence of map representations on socio-economic prediction

David Koch,Simon Thaler,Zedong Zhang,Miroslav Despotovic
DOI: https://doi.org/10.1080/2150704x.2024.2359097
IF: 2.369
2024-06-19
Remote Sensing Letters
Abstract:Incorporating satellite imagery is crucial for remote sensing and computer vision in socio-economic studies. Machine learning techniques are typically used to extract valuable information from satellite images. This article presents an enhanced approach applying computer vision not only to satellite images but also to five different map sources, such as OpenStreetMap and building footprints. The goal is to determine if useful insights can be derived from simplified feature representations, improving the understanding of fundamental satellite imagery data. We conducted an experiment predicting the settlement patterns of university graduates in Vienna, using a convolutional neural network (CNN) to analyze grid cell images (250 m × 250 m) from satellites and five different maps. The model predicted five density classes of graduates, achieving an accuracy rate of 35.99% using building footprints, outperforming the 35.15% accuracy based on satellite images, while other map representations underperformed. These results suggest that building outlines and the open space between buildings contain vital predictive information. Our findings highlight the potential of this approach beyond socio-economic variables, demonstrating the capability of understanding maps via CNNs.
imaging science & photographic technology,remote sensing
What problem does this paper attempt to address?
This paper aims to explore the impact of different map representations on socioeconomic predictions, especially beyond the scope of satellite imagery. The researchers not only used satellite images but also introduced five different map sources, including OpenStreetMap and building outlines, to analyze whether these simplified feature representations can provide valuable insights into socioeconomic factors. Through experiments on predicting the residential patterns of graduates from the University of Vienna, they utilized Convolutional Neural Networks (CNN) to analyze images of 250-meter by 250-meter grid cells, attempting to predict graduate distributions in five density levels. The results showed that the prediction accuracy based on building outlines reached 35.99%, surpassing the accuracy of 35.15% based on satellite imagery, while other map representations performed worse. This indicates the crucial role of information from building outlines and open spaces in predictions, highlighting the ability of CNN to understand maps and its potential applications beyond socioeconomic variable prediction. The paper compares various remote sensing representations, from high-resolution satellite imagery to abstract maps such as building outlines and OpenStreetMap's detailed street layout, to explore the effectiveness and impact of these representations in socioeconomic analysis. The study found that even the most simplified map representations, such as binary representations containing only building outlines, may contain sufficient information for computer vision algorithms to accurately predict variables in a socioeconomic context, such as educational levels or wealth distribution. Additionally, the study discusses how contrast, feature noise, and specific map elements (such as tram stations) affect prediction accuracy and points out the correlation between green space, building density, and the residential choices of the educated population. In summary, this paper emphasizes the importance of integrating multiple map data sources in socioeconomic predictions, particularly in identifying key geographic features related to demographic characteristics. It provides a new perspective for understanding the population structure of urban areas and also paves the way for future research in the field of map production and spatial data analysis.