Uncertainties Analysis of Collapse Susceptibility Prediction Based on Remote Sensing and GIS: Influences of Different Data-Based Models and Connections between Collapses and Environmental Factors

Wenbin Li,Xuanmei Fan,Faming Huang,Wei Chen,Haoyuan Hong,Jinsong Huang,Zizheng Guo
DOI: https://doi.org/10.3390/rs12244134
IF: 5
2020-12-17
Remote Sensing
Abstract:To study the uncertainties of a collapse susceptibility prediction (CSP) under the coupled conditions of different data-based models and different connection methods between collapses and environmental factors, An’yuan County in China with 108 collapses is used as the study case, and 11 environmental factors are acquired by data analysis of Landsat TM 8 and high-resolution aerial images, using a hydrological and topographical spatial analysis of Digital Elevation Modeling in ArcGIS 10.2 software. Accordingly, 20 coupled conditions are proposed for CSP with five different connection methods (Probability Statistics (PSs), Frequency Ratio (FR), Information Value (IV), Index of Entropy (IOE) and Weight of Evidence (WOE)) and four data-based models (Analytic Hierarchy Process (AHP), Multiple Linear Regression (MLR), C5.0 Decision Tree (C5.0 DT) and Random Forest (RF)). Finally, the CSP uncertainties are assessed using the area under receiver operation curve (AUC), mean value, standard deviation and significance test, respectively. Results show that: (1) the WOE-based models have the highest AUC accuracy, lowest mean values and average rank, and a relatively large standard deviation; the mean values and average rank of all the FR-, IV- and IOE-based models are relatively large with low standard deviations; meanwhile, the AUC accuracies of FR-, IV- and IOE-based models are consistent but higher than those of the PS-based model. Hence, the WOE exhibits a greater spatial correlation performance than the other four methods. (2) Among all the data-based models, the RF model has the highest AUC accuracy, lowest mean value and mean rank, and a relatively large standard deviation. The CSP performance of the RF model is followed by the C5.0 DT, MLR and AHP models, respectively. (3) Under the coupled conditions, the WOE-RF model has the highest AUC accuracy, a relatively low mean value and average rank, and a high standard deviation. The PS-AHP model is opposite to the WOE-RF model. (4) In addition, the coupled models show slightly better CSP performances than those of the single data-based models not considering connect methods. The CSP performance of the other models falls somewhere in between. It is concluded that the WOE-RF is the most appropriate coupled condition for CSP than the other models.
environmental sciences,imaging science & photographic technology,remote sensing,geosciences, multidisciplinary
What problem does this paper attempt to address?
Based on the provided paper abstract and partial content, this research mainly attempts to solve the following problems: 1. **Uncertainty analysis**: Study the uncertainty of Collapse Susceptibility Prediction (CSP) under different data models and different connection methods. Specifically, the author explores the influence of the coupling conditions between different data models and different connection methods on the CSP results by analyzing these conditions. 2. **Selection of connection methods and data models**: Evaluate the performance of five different non - linear connection methods (Probability Statistics (PSs), Frequency Ratio (FR), Information Value (IV), Index of Entropy (IOE) and Weight of Evidence (WOE)) and four data models (Analytic Hierarchy Process (AHP), Multiple Linear Regression (MLR), C5.0 Decision Tree (C5.0 DT) and Random Forest (RF)) in Collapse Susceptibility Prediction to determine the most appropriate combination of connection methods and data models. 3. **Improve prediction accuracy**: Look for the best method to improve the accuracy of Collapse Susceptibility Prediction by evaluating the CSP performance under different model combinations. The research uses indicators such as Area Under Curve (AUC), mean, standard deviation and significance test to evaluate the performance of the models. ### Research background Collapse is a common geological disaster, which poses a serious threat to human life and property and causes environmental problems. Collapse Susceptibility Prediction (CSP) can effectively reflect the spatial probability of collapse occurrence in a certain area, but the uncertainty of CSP will increase the project construction risk and limit the land use in high - collapse - occurrence areas. Therefore, how to effectively carry out CSP has become one of the focuses of collapse research. ### Research methods 1. **Data collection**: The research selects Anyuan County in China as the case study area, where 108 collapse events have been recorded. Through the analysis of Landsat TM 8 and high - resolution aerial images, data of 11 environmental factors are obtained, including Digital Elevation Model (DEM), slope, aspect, profile curvature, plane curvature and terrain undulation, etc. 2. **Model construction**: Propose 20 different CSP modeling conditions, combining five different connection methods and four different data models. 3. **Performance evaluation**: Use methods such as AUC, mean, standard deviation and significance test to evaluate the CSP performance under different model combinations. ### Main findings 1. **Connection methods**: The WOE method shows the highest AUC accuracy, the lowest mean and average ranking among all connection methods, and has a relatively large standard deviation. The AUC accuracies of the FR, IV and IOE methods are the same and higher than that of the PS method. 2. **Data models**: The Random Forest (RF) model shows the highest AUC accuracy, the lowest mean and average ranking among all data models, and has a relatively large standard deviation. The performance of the C5.0 DT, MLR and AHP models decreases in turn. 3. **Optimal combination**: The WOE - RF model shows the highest AUC accuracy, a relatively low mean and average ranking, and a relatively high standard deviation among all combinations. The performance of the PS - AHP model is the opposite of that of the WOE - RF model. 4. **Advantages of the coupling model**: The CSP performance of the coupling model is slightly better than that of a single data model without considering the connection method. In conclusion, the research believes that the WOE - RF is the most suitable model combination for Collapse Susceptibility Prediction.