ST-ADPTC: a method for clustering spatiotemporal raster data based on improved density peak detection

Jie SongSongshan YueMin ChenZhuo SunYongning WenLingzhi Suna Key Laboratory of Virtual Geographic Environment,Nanjing Normal University,Ministry of Education,Nanjing,Chinab State Key Laboratory Cultivation Base of Geographical Environment Evolution (Jiangsu Province),Nanjing Normal University,Nanjing,Chinac Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application,Nanjing Normal University,Nanjing,Chinad Wuhan Geomatics Institute,Wuhan,Chinae The 3rd Geoinformation Mapping Institute of Ministry of Natural Resources,Chengdu,ChinaJie Song received the Mater degree in geographical information science from Nanjing Normal University. He is currently an assistant engineer at Wuhan Geomatics Institute. His research interests include spatiotemporal pattern analysis and data mining.Songshan Yue is currently an associate professor at Nanjing Normal University. His research interests include geographic information system,geo-pattern mining and open geographic modelling and simulation.Min Chen is currently a professor at Nanjing Normal University. His research interests include geographic modelling,open geographic modelling and simulation and virtual geographic environment.Zhuo Sun is currently a PhD candidate at Nanjing Normal University and her research interests focus on deep learning and geographical pattern mining.Yongning Wen is currently a professor at Nanjing Normal University. His research interests include geographic modelling and virtual geographic environment.Lingzhi Sun received the Mater degree in geographical information science from Nanjing Normal University. He is currently an assistant engineer at the 3rd Geoinformation Mapping Institute of Ministry of Natural Resources. His research interests include digital cartography and 3D visualization.
DOI: https://doi.org/10.1080/13658816.2024.2353703
2024-05-18
International Journal of Geographical Information Science
Abstract:Spatiotemporal raster (STR) data employ an array of grids to represent temporally varying and spatially distributed information, commonly utilized for recording environmental variables and socioeconomic indices. To reveal the geographic patterns embedded in STR data, the clustering by fast search and finding of density peaks (CFSFDP) algorithm is considered effective and suitable. However, this algorithm encounters limitations in identifying cluster centers, handling large data volumes, and measuring the coupled spatial-temporal-attribute distance when applied to STR data. To overcome these challenges, we propose an improved method named spatial temporal-adaptive density peak tree clustering (ST-ADPTC). This method leverages adaptive density peak tree segmentation to identify cluster centers and optimizes memory usage through the k-nearest neighbors (kNN) technique. By constructing a neighborhood that incorporates both spatiotemporal and thematic attribute similarities, ST-ADPTC computes the local density of STR data, facilitating the discovery of time-varying clusters. Based on the proposed method, we develop an open-source Python package (Geo_ADPTC). Experiments conducted using benchmarking datasets illustrate improvements in cluster identification and memory reduction. Additionally, a case study of sea surface temperature data demonstrates the feasibility and effectiveness of exploring spatial and temporal distribution patterns using the proposed method.
geography, physical,computer science, information systems,information science & library science
What problem does this paper attempt to address?