SGWR: similarity and geographically weighted regression
M. Naser Lessani,Zhenlong Li,M. Naser LessaniZhenlong LiGeoinformation and Big Data Research Laboratory,Department of Geography,The Pennsylvania State University,University Park,PA,USAM. Naser Lessani is currently a PhD student in the Department of Geography at The Pennsylvania State University. His primary research interests are geospatial big data analytics,human mobility,parallel spatial computing,Machine Learning,and GIS. He contributed to conceptualization of the research idea,data analysis,code development,and writing.Zhenlong Li is an Associate Professor in the Department of Geography and Director of the Geoinformation and Big Data Research Lab at The Pennsylvania State University. His primary research field is GIScience with a focus on geospatial big data analytics,spatial computing,and geospatial AI with applications to disaster management,human mobility,and public health. He contributed to conceptualization of the research idea,data analysis,and writing.
DOI: https://doi.org/10.1080/13658816.2024.2342319
2024-04-17
International Journal of Geographical Information Science
Abstract:Geographically weighted regression (GWR) offers a local approach to modeling spatial data, considering geographical location and spatial relationships between observations. A salient feature of GWR is the emphasis on geographical proximity, in accordance with Tobler's First Law of Geography, which assumes that closer entities have a greater influence on the target location. Traditional GWR models have been augmented to consider various forms of physical distances aimed at enhancing model performance, and they often disregarded the potential influence of other data attributes, a shortcoming that extends to most GWR extensions. In this study, we introduce a novel weight matrix construction, which integrates data attribute similarity alongside the conventional geographically weighted matrix. The two weights are integrated in a manner that results in improved model performance. The proposed model, called Similarity and Geographically Weighted Regression or SGWR, was applied to five distinct datasets: housing prices, crime rates, and three health outcomes including mental health, depression, and HIV. Results show that SGWR significantly improved model performance based on several statistical measures, outperforming the global regression model and the traditional GWR.
geography, physical,computer science, information systems,information science & library science