GeoImageNet: a multi-source natural feature benchmark dataset for GeoAI and supervised machine learning

Wenwen Li,Sizhe Wang,Samantha T. Arundel,Chia-Yu Hsu
DOI: https://doi.org/10.1007/s10707-022-00476-z
IF: 2.7729
2022-09-04
GeoInformatica
Abstract:The field of GeoAI or Geospatial Artificial Intelligence has undergone rapid development since 2017. It has been widely applied to address environmental and social science problems, from understanding climate change to tracking the spread of infectious disease. A foundational task in advancing GeoAI research is the creation of open, benchmark datasets to train and evaluate the performance of GeoAI models. While a number of datasets have been published, very few have centered on the natural terrain and its landforms. To bridge this gulf, this paper introduces a first-of-its-kind benchmark dataset, GeoImageNet, which supports natural feature detection in a supervised machine-learning paradigm. A distinctive feature of this dataset is the fusion of multi-source data, including both remote sensing imagery and DEM in depicting spatial objects of interest. This multi-source dataset allows a GeoAI model to extract rich spatio-contextual information to gain stronger confidence in high-precision object detection and recognition. The image dataset is tested with a multi-source GeoAI extension against two well-known object detection models, Faster-RCNN and RetinaNet. The results demonstrate the robustness of the dataset in aiding GeoAI models to achieve convergence and the superiority of multi-source data in yielding much higher prediction accuracy than the commonly used single data source.
computer science, information systems,geography, physical
What problem does this paper attempt to address?