Unsupervised machine learning with different sampling strategies and topographic factors for distinguishing between landslide source and runout areas to improve landslide inventory production

Jhe-Syuan Lai Jun-Yi Huang Hong-Mao Huang Yung-Chung Chuang a Department of Civil Engineering,Feng Chia University,Taichung,Taiwanb Institute of Earth Sciences,Academia Sinica,Taipei,Taiwanc Department of Urban Planning and Spatial Information,Feng Chia University,Taichung,Taiwan
DOI: https://doi.org/10.1080/19475705.2024.2406302
2024-09-26
Geomatics Natural Hazards and Risk
Abstract:This study derived 12 topographical and hydrological factors related to landslides from a 10-m digital elevation model. Three unsupervised machine learning algorithms were employed to distinguish between the features of landslide sources and runout areas for Typhoon Morakot. Two sampling strategies were designed in this study. The first strategy involved creating a data set by pairing landslide sources and runout areas on the basis of the size of polygonal areas recorded in the inventory and then exploring the effectiveness of feature separation with k-means++ as the primary method and EM (expectation maximization) and DBSCAN (Density-Based Spatial Clustering of Applications with Noise) as supplementary methods. Because of practical challenges associated with the inability to determine whether remote sensing results correspond to landslide sources or runout areas, the first sampling strategy was used to test the feasibility of the adopted algorithms. In the second sampling strategy, the effectiveness of feature separation was tested by dividing polygon samples into several intervals on the basis of their sizes, thereby verifying the practicability of the adopted algorithms in real-world operations. This study verified that combining unsupervised machine learning algorithms with topographic factors facilitates the effective separation of landslide sources and runout areas, thereby enhancing the quality of landslide inventories and reducing the cost of landslide inventory creation. Moreover, it indicated that meter-scale topographic data yield classification results similar to those obtained from manually created landslide inventories based on centimeter-scale stereo aerial photos. The proposed method effectively balances the cost and quality of landslide inventories.
geosciences, multidisciplinary,meteorology & atmospheric sciences,water resources
What problem does this paper attempt to address?