Prediction of Soil Organic Carbon Fractions in Tropical Cropland Using a Regional Visible and Near-Infrared Spectral Library and Machine Learning

Lingju Dai,Zheng Wang,Zhiqing Zhuo,Yuxin Ma,Zhou Shi,Songchao Chen
DOI: https://doi.org/10.1016/j.still.2024.106297
2025-01-01
Abstract:Soil organic carbon (SOC) is not a single and uniform entity, therefore understanding SOC fractions, particularly particulate organic carbon (POC) and mineral-associated organic carbon (MAOC), offers valuable insights into SOC dynamics. However, traditional laboratory measurements of SOC fractions are labor-intensive and costly. Therefore, leveraging rapid and cost-effective soil spectroscopy holds significant promise for addressing this challenge. While previous studies have concentrated on predicting SOC fractions using mid-infrared (MIR) spectroscopy, the potential of visible and near-infrared (VNIR) spectroscopy remains relatively unexplored, especially for tropical soils. To fill this gap, we evaluated six machine learning approaches, including three global models (Cubist, random forest (RF), partial least squares regression (PLSR)) and three local models (memory-based learning fitted by applying partial least squares regression (MBL-PLSR) and Gaussian process local regressions (MBL-GPR), non-linear memory-based learning (N-MBL)), for predicting POC and MAOC (g C kg(-1) soil) based on a regional soil VNIR spectral library (224 samples) from lateritic red soil in the tropical region of Guangdong Province, China. We also assessed the impact of variable selection on improving model performance by iteratively evaluating and removing insignificant predictor variables to determine the optimal number of predictors. The results showed that: (1) MBL-PLSR and N-MBL demonstrated commendable predictive performance, attaining coefficients of determination (R-2) of 0.73 and 0.72 for POC, and 0.53 and 0.55 for MAOC on the validation set, respectively, outperforming Cubist and PLSR; (2) variable selection simplified predictive models by identifying the best spectral bands, leading to improved predictive accuracy for both POC (R-2 increased from 0.68 to 0.73) and MAOC (R-2 increased from 0.49 to 0.55); (3) the overall predictive performance of VNIR spectroscopy was higher for POC (R-2 of 0.73) compared to MAOC (R-2 of 0.55), while MAOC could be predicted more accurately by subtracting POC predictions from SOC observations (R-2 of 0.73). The favorable predictive accuracy underscores VNIR spectroscopy's viability for POC predictions. Additionally, MAOC can be well predicted by subtracting the predicted POC from the measured SOC. The outcomes of this study offers valuable insights for predicting SOC fractions using VNIR spectroscopy.
What problem does this paper attempt to address?