A similarity-based automatic data recommendation approach for geographic models.
Yunqiang Zhu,A-Xing Zhu,Min Feng,Jia Song,Hongwei Zhao,Jie Yang,Qiuyi Zhang,Kai Sun,Jinqu Zhang,Ling Yao
DOI: https://doi.org/10.1080/13658816.2017.1300805
2017-01-01
International Journal of Geographical Information Science
Abstract:The complexity of geographic modelling is increasing; hence, preparing data to drive geographic models is becoming a time-consuming and difficult task that may significantly hinder the application of such models. Meanwhile, a huge number of data sets have been shared and have become publicly accessible through the Internet. This study presents a data similarity-based approach to automatically recommend available data sets to fulfil the data requirements of geographic models. Unified description factors are adopted to provide a consistent description of public data sets and input data requirements of geographic models. Five elementary data similarities between them, specifically content, spatial coverage, temporal coverage, spatial precision, and temporal granularity similarities, are calculated. An overall similarity is estimated from aggregating the elementary data similarities. Thereafter, the candidate data for running the models are recommended in the order of overall data similarity. As a case study, the approach has been applied to recommend data from the China National Data Sharing Platform of Earth System Science to drive the population spatialization model PSM. The approach has successfully recommended the most related data sets to run PSM. The result also suggests that the data recommendation approach can facilitate the intelligent identification of geographic data and the building of links between the open data sets.