A multicriteria optimization framework for the definition of the spatial granularity of urban social media analytics

Sidgley Camargo de Andrade,Camilo Restrepo-Estrada,Luiz Henrique Nunes,Carlos Augusto Morales Rodriguez,Júlio Cézar Estrella,Alexandre Cláudio Botazzo Delbem,João Porto de Albuquerque
DOI: https://doi.org/10.1080/13658816.2020.1755039
2020-06-19
International Journal of Geographical Information Science
Abstract:<span>The spatial analysis of social media data has recently emerged as a significant source of knowledge for urban studies. Most of these analyses are based on an areal unit that is chosen without the support of clear criteria to ensure representativeness with regard to an observed phenomenon. Nonetheless, the results and conclusions that can be drawn from a social media analysis to a great extent depend on the areal unit chosen, since they are faced with the well-known Modifiable Areal Unit Problem. To address this problem, this article adopts a data-driven approach to determine the most suitable areal unit for the analysis of social media data. Our multicriteria optimization framework relies on the Pareto optimality to assess candidate areal units based on a set of user-defined criteria. We examine a case study that is used to investigate rainfall-related tweets and to determine the areal units that optimize spatial autocorrelation patterns through the combined use of indicators of global spatial autocorrelation and the variance of local spatial autocorrelation. The results show that the optimal areal units (30 km<sup>2</sup> and 50 km<sup>2</sup>) provide more consistent spatial patterns than the other areal units and are thus likely to produce more reliable analytical results.</span>
geography, physical,computer science, information systems,information science & library science
What problem does this paper attempt to address?