Interpretability of Statistical, Machine Learning, and Deep Learning Models for Landslide Susceptibility Mapping in Three Gorges Reservoir Area

Cheng Chen,Lei Fan
2024-05-29
Abstract:Landslide susceptibility mapping (LSM) is crucial for identifying high-risk areas and informing prevention strategies. This study investigates the interpretability of statistical, machine learning (ML), and deep learning (DL) models in predicting landslide susceptibility. This is achieved by incorporating various relevant interpretation methods and two types of input factors: a comprehensive set of 19 contributing factors that are statistically relevant to landslides, as well as a dedicated set of 9 triggering factors directly associated with triggering landslides. Given that model performance is a crucial metric in LSM, our investigations into interpretability naturally involve assessing and comparing LSM accuracy across different models considered. In our investigation, the convolutional neural network model achieved the highest accuracy (0.8447 with 19 factors; 0.8048 with 9 factors), while Extreme Gradient Boosting and Support Vector Machine also demonstrated strong predictive capabilities, outperforming conventional statistical models. These findings indicate that DL and sophisticated ML algorithms can effectively capture the complex relationships between input factors and landslide occurrence. However, the interpretability of predictions varied among different models, particularly when using the broader set of 19 contributing factors. Explanation methods like SHAP, LIME, and DeepLIFT also led to variations in interpretation results. Using a comprehensive set of 19 contributing factors improved prediction accuracy but introduced complexities and inconsistency in model interpretations. Focusing on a dedicated set of 9 triggering factors sacrificed some predictive power but enhanced interpretability, as evidenced by more consistent key factors identified across various models and alignment with the findings of field investigation reports....
Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the issue of interpretability of statistical, machine learning (ML), and deep learning (DL) models in landslide susceptibility mapping (LSM) in the Three Gorges Reservoir area. Specifically, the study aims to: 1. **Evaluate and compare the performance of different models in predicting landslide susceptibility**: By using 19 landslide-related contributing factors and 9 direct triggering factors, the study investigates the accuracy and performance of different models (including statistical models, machine learning models, and deep learning models) in predicting landslide susceptibility. 2. **Explore the interpretability of different models**: The study employs various interpretation methods (such as SHAP, LIME, and DeepLIFT) to assess the performance of different models in global and local interpretations, in order to understand the contribution of each input factor to the occurrence of landslides. 3. **Balance accuracy and interpretability**: The study explores how to maintain model interpretability while improving prediction accuracy when selecting input factors. Specifically, the study finds that using 19 contributing factors can improve prediction accuracy but introduces complexity and consistency issues in interpretation results; whereas using 9 triggering factors, although sacrificing some predictive capability, enhances model interpretability. In summary, the paper aims to provide more accurate and interpretable model selection recommendations for landslide susceptibility mapping by systematically evaluating and comparing different models and their interpretation methods.