Uncertainty Quantification of Soil Organic Carbon Estimation from Remote Sensing Data with Conformal Prediction

Nafiseh Kakhani,Setareh Alamdar,Ndiye Michael Kebonye,Meisam Amani,Thomas Scholten
DOI: https://doi.org/10.3390/rs16030438
IF: 5
2024-01-23
Remote Sensing
Abstract:Soil organic carbon (SOC) contents and stocks provide valuable insights into soil health, nutrient cycling, greenhouse gas emissions, and overall ecosystem productivity. Given this, remote sensing data coupled with advanced machine learning (ML) techniques have eased SOC level estimation while revealing its patterns across different ecosystems. However, despite these advances, the intricacies of training reliable and yet certain SOC models for specific end-users remain a great challenge. To address this, we need robust SOC uncertainty quantification techniques. Here, we introduce a methodology that leverages conformal prediction to address the uncertainty in estimating SOC contents while using remote sensing data. Conformal prediction generates statistically reliable uncertainty intervals for predictions made by ML models. Our analysis, performed on the LUCAS dataset in Europe and incorporating a suite of relevant environmental covariates, underscores the efficacy of integrating conformal prediction with another ML model, specifically random forest. In addition, we conducted a comparative assessment of our results against prevalent uncertainty quantification methods for SOC prediction, employing different evaluation metrics to assess both model uncertainty and accuracy. Our methodology showcases the utility of the generated prediction sets as informative indicators of uncertainty. These sets accurately identify samples that pose prediction challenges, providing valuable insights for end-users seeking reliable predictions in the complexities of SOC estimation.
environmental sciences,imaging science & photographic technology,remote sensing,geosciences, multidisciplinary
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve This paper aims to address the issue of quantifying uncertainty in the estimation of soil organic carbon (SOC) content. Specifically, the authors propose a method using **Conformal Prediction** to assess the uncertainty of SOC estimation results based on remote sensing data. The main problems the paper attempts to solve are as follows: 1. **Improving the Reliability of SOC Estimation**: - Although remote sensing technology and machine learning methods have made progress in SOC estimation, training reliable and highly certain SOC models for specific user needs remains challenging. - Existing methods have uncertainties in SOC estimation, necessitating a reliable technique to quantify these uncertainties. 2. **Improvement of Uncertainty Quantification Techniques**: - Propose a new method that combines conformal prediction with machine learning models like random forests to generate statistically reliable uncertainty intervals. - This approach allows for more accurate identification of samples that are difficult to predict, thereby providing users with more reliable SOC estimation results. 3. **Comparison with Other Uncertainty Quantification Methods**: - The study not only introduces the conformal prediction method but also compares its results with other commonly used uncertainty quantification methods. - Different evaluation metrics are used to assess the model's uncertainty and accuracy, demonstrating the effectiveness of the prediction sets generated by conformal prediction as indicators of uncertainty. Through this method, the paper aims to provide a new tool for the field of soil organic carbon estimation that is more transparent, interpretable, and performance-assured, thereby enhancing user confidence in the estimation results.