Modeling population distribution: A visual and quantitative analysis of gradient boosting and deep learning models for multi‐output spatial disaggregation

Marina Georgati,João Monteiro,Bruno Martins,Carsten Keßler,Henning Sten Hansen
DOI: https://doi.org/10.1111/tgis.13130
IF: 2.568
2024-01-11
Transactions in GIS
Abstract:Spatially aggregated data on socio‐demographic groups often fail to capture the population's spatial heterogeneity in cities. This poses challenges for urban planning, particularly when addressing the needs of groups such as migrants or families with children. Moreover, the commonly provided aggregated units, such as census tracts, vary in size and across data sources. Existing literature on disaggregation typically handles individual subgroups separately, ignoring their interrelations in the downscaling process. This article explores the potentials of multi‐output regression models for simultaneous spatial downscaling of multiple groups and conducts a detailed spatial error analysis using individualized neighborhoods. We experiment with self‐training gradient‐boosting trees and fully convolutional neural networks, assessing the quality of results against ground truth data at the target resolution. We show that the evaluation of the disaggregated results at this detailed resolution requires unconventional methods. The methodology proves convenient and achieves high‐accuracy results using input datasets of building features.
geography
What problem does this paper attempt to address?