Selection of environmental variables and their scales in multiple soil properties mapping: A case study in Heilongjiang Heshan Farm

SHI Jingjing,YANG Lin,ZENG Canying,ZHU Axing,QIN Chengzhi,LIANG Peng
DOI: https://doi.org/10.11821/dlyj201803014
2018-01-01
Geographical Research
Abstract:Studying the relevant environmental variables with consideration of scales for different soil properties is meaningful to understand the generation and development of soil properties,and also necessary in multiple soil properties mapping and sampling.This study explored multiple soil properties' relevant environmental variables and their scales,andexamined the impact of different environmental variables and their scales on the prediction of different soil properties.Our study area is Heshan Farm,and the target soil properties are topsoil clay content,sand content,silt content,topsoil organic matter content (SOM),and soil depth.One hundred and seventy-three multi-scale terrain variables were generated by changing neighborhood size for calculation.The single scale and multi-scale variables were ranked according to their variable importance calculated by Random Forest.Subsets 1 and 2 were selected from single scale and multi-scale variables respectively based on their variable importance with elimination of multi-collinearity.Subset 3 was taken as a reference subset andselected based on the expert knowledge.The selected subset 1 had little common with subset 3.This indicates that the environmental variables selected based on expert knowledge may be not the most important variables for the soil properties.Subset 2 had a high overlap with subset 3 though the scales were different for different environmental variables and soil properties.For the case of soil sand and silt,their relevant variables and scales were similar but quite different from soil clay's,and the SOM and soil depth had similar relevant variables.The mapping results based on the three subsets showed that using environmental variables in subset 1 was more accurate than using environmental variables in subset 3 for all soil properties except for sand content,the improvements of mean RMSEs were 1.8%~13.1%.Using environmental variables in subset 2 was more accurate than using environmental variables in subsets 1 and 3 for all the five soil properties,the improvements of mean RMSEs were 8.7%~16.5% and 7.8%~ 21.3%.It was shown that using reference variables with proper scales is more important than using top-ranked single scale variables for mapping.
What problem does this paper attempt to address?