A China dataset of soil properties for land surface modeling (version 2)

Gaosong Shi,Wenye Sun,Wei Shangguan,Zhongwang Wei,Hua Yuan,Ye Zhang,Hongbin Liang,Lu Li,Xiaolin Sun,Danxi Li,Feini Huang,Qingliang Li,Yongjiu Dai
DOI: https://doi.org/10.5194/essd-2024-299
IF: 11.4
2024-09-01
Earth System Science Data
Abstract:Accurate and high-resolution spatial soil information is crucial for efficient and sustainable land use, management, and conservation. Since the establishment of digital soil mapping (DSM) and the GlobalSoilMap working group, significant advances have been made in spatial soil information globally. However, accurately predicting soil variation over large and complex areas with limited samples remains a challenge, especially for China, which has diverse soil landscapes. To address this challenge, we utilized 11,209 representative multi-source legacy soil profiles (including the Second National Soil Survey of China, World Soil Information Service, First National Soil Survey of China, and regional databases) and high-resolution soil-forming environment characterization. Using advanced Quantile Regression Forest algorithms and a high-performance parallel computing strategy, we developed comprehensive maps of 23 soil physical, chemical and fertility properties at six standard depth layers from 0 to 2 meters in China with a 90 m spatial resolution (China dataset of soil properties for land surface modeling version 2, CSDLv2). Data-splitting and independent samples validation strategies were employed to evaluate the accuracy of the predicted maps quality. The results showed that the predicted maps were significantly more accurate and detailed compared to traditional soil type linkage methods (i.e., CSDLv1, the first version of the dataset), SoilGrids 2.0, and HWSD 2.0 products, effectively representing the spatial variation of soil properties across China. The prediction accuracy of most soil properties at the 0–5 cm depth interval ranged from good to moderate, with Model Efficiency Coefficients for most soil properties ranging from 0.75 to 0.32 during data-splitting validation and from 0.88 to 0.25 during independent sample validation. The wide range between the 5 % lower and 95 % upper prediction limits may indicate substantial room for improvement in current predictions. The relative importance of environmental covariates in predictions varied with soil properties and depth, indicating the complexity of interactions among multiple factors in the soil formation processes. As the soil profiles used in this study mainly originate from the Second National Soil Survey of China during 1970s and 1980s, they could provide new perspectives of soil changes together with existing maps based on 2010s soil profiles. The findings make important contributions to the GlobalSoilMap project and can also be used for regional Earth system modeling and land surface modeling to better represent the role of soil in hydrological and biogeochemical cycles in China. This dataset is freely available and can be accessed at https://doi.org/10.11888/Terre.tpdc.301235 (Shi et al, 2024).
geosciences, multidisciplinary,meteorology & atmospheric sciences
What problem does this paper attempt to address?