High-spatial-resolution surface soil moisture retrieval using the Deep Forest model in the cloud environment over the Tibetan Plateau

Zhenghao Li,Qiangqiang Yuan,Xin Su
DOI: https://doi.org/10.1080/10095020.2024.2307931
IF: 4.278
2024-02-17
Geo-spatial Information Science
Abstract:As a key climate variable, soil moisture plays a crucial role in drought detection, flood warning, and crop yield prediction. In recent years, the demand for high-spatial-resolution soil moisture has increased, particularly in environmental management. In this study, Copernicus Sentinel-1 synthetic aperture radar data, Sentinel-2 multi-spectral data, and other auxiliary data (land cover types, soil texture, etc.) were used to retrieve surface soil moisture (10 m) in the cloud environment (Google Earth Engine + Google Colab + Google Drive) over the Tibetan Plateau, and an entirely data-driven machine learning-based model called Deep Forest was adopted. We discussed the application of the Deep Forest model and compared it with other machine learning models. Overall, on the basis of 10-fold cross-validation, the modified Deep Forest model performed the best, with estimate accuracy of 0.834 and 0.038 m 3 ·m −3 in terms of coefficient of determination ( R2 ) and unbiased Root Mean Square Error (ubRMSE), respectively. It also demonstrated the best performance in site-based validation ( R2 of 0.606 and ubRMSE of 0.092 m 3 ·m −3 ). In addition, the framework for the data acquisition, data preprocessing, model training, and soil moisture mapping in this study was completed in the cloud environment, which facilitated the entire retrieval process. This work provides new ideas beyond the retrieval model for other related studies.
remote sensing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to achieve the extraction of high - spatial - resolution (10 - meter) surface soil moisture in the Qinghai - Tibet Plateau region. Specifically, the researchers utilized Copernicus Sentinel - 1 Synthetic Aperture Radar data, Sentinel - 2 multispectral data, and other auxiliary data (such as land cover types, soil texture, etc.), and adopted a fully data - driven machine - learning model - the Deep Forest model in the cloud environment (Google Earth Engine + Google Colab + Google Drive) to extract surface soil moisture. The main objectives of the paper include: 1. **Improve spatial resolution**: By using high - resolution satellite data and advanced machine - learning methods, improve the spatial resolution of surface soil moisture to meet the application requirements at the local scale. 2. **Evaluate model performance**: Compare the performance of the Deep Forest model with other common machine - learning models (such as K - Nearest Neighbor algorithm, Support Vector Machine, Random Forest, Gradient Boosting Regression Tree, and Generalized Regression Neural Network) in surface soil moisture extraction. 3. **Construct a cloud - environment framework**: Develop a framework that is completely cloud - based for data acquisition, pre - processing, model training, and soil - moisture mapping to simplify the entire extraction process. Through these objectives, the research aims to provide an efficient, accurate, and easy - to - implement method for the extraction of high - spatial - resolution surface soil moisture, which is of great significance especially in fields such as environmental management.