Extracting Typical Samples Based on Image Environmental Factors to Obtain an Accurate and High-Resolution Soil Type Map

Changda Zhu,Fubin Zhu,Cheng Li,Yunxin Yan,Wenhao Lu,Zihan Fang,Zhaofu Li,Jianjun Pan
DOI: https://doi.org/10.3390/rs16071128
IF: 5
2024-03-24
Remote Sensing
Abstract:Soil surveying and mapping provide important support for environmental science research on soil and other resources. Due to the rapid change in land use and the long update cycle of soil maps, historical conventional soil maps (CSMs) may be outdated and have low accuracy. Therefore, there is an urgent need for accurate and up-to-date soil maps. Soil has a high correlation with its corresponding environmental factors in space, and typical samples contain an appropriate soil–environment relationship of soil types. Understanding how to extract typical samples according to environmental factors and determine the implied soil–environment relationship is the key to updating soil maps. In this study, a hierarchical typical sample extraction method based on land use type and environmental factors was designed. According to the corresponding relationship between the soil type and the land use type (ST-LU), the outdate soil map patches caused by changes in land use were excluded, follow by typical samples being extracted according to the peak intervals of the soil–environmental factor histograms. Additionally, feature selection was performed through variance analysis and mutual information, and four machine learning models were used to predict soil types. In addition, the influence of environmental factors on soil prediction was discussed, in terms of variable importance analysis. Using an overall common validation set, the results show that the prediction accuracy using typical samples for learning in the modeling set is above 0.8, while the prediction accuracy when using random samples is only about 0.4. Compared with the original soil map, the accuracy and resolution of the predicted soil maps based on typical samples are greatly improved. In general, typical samples can effectively explore the actual soil–environment knowledge implied in the soil type map. By extracting typical samples from historical soil type map and combining them with high-resolution remote sensing data, we can generate new soil type maps with high accuracy and short update cycle. This can provide some references for typical sampling design and soil type prediction.
environmental sciences,imaging science & photographic technology,remote sensing,geosciences, multidisciplinary
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve the problem that Conventional Soil Maps (CSMs) become obsolete and of low precision due to land - use change and long update cycles. Specifically, the paper proposes a method of extracting typical samples based on land - use types and environmental factors to generate soil - type maps with high precision and high resolution. ### Main research contents of the paper 1. **Background and motivation**: - Traditional soil survey and mapping methods rely on topographic maps and aerial or satellite images, and experts understand soil - landscape relationships through field surveys. Although these methods are reliable, they are labor - intensive, highly professional, and limited by technology, economy and time, resulting in long update cycles. - Historical Conventional Soil Maps (CSMs) may be decades old and thus need to be updated to reflect the current soil conditions. 2. **Research methods**: - **Typical sample extraction**: According to the relationship between soil types and land - use types (ST - LU), obsolete soil - map patches caused by land - use change are excluded. Then, the typical areas are determined by analyzing the peak intervals of the distribution histograms of soil - environmental factors. - **Feature selection**: Key environmental variables are selected through analysis of variance and mutual information. - **Machine - learning models**: Four machine - learning models (random forest, bagged classification and regression trees, bagged flexible discriminant analysis, neural network) are used for soil - type prediction, and the influence of environmental factors on soil prediction is discussed through variable importance analysis. 3. **Results and analysis**: - The prediction accuracy of modeling with typical samples is higher than 0.8, while the prediction accuracy of using random samples is only about 0.4. - Compared with the original soil map, the predicted soil map based on typical samples has been significantly improved in terms of accuracy and resolution. - Typical samples can effectively explore the actual soil - environmental knowledge implicit in the soil - type map. ### Conclusion By extracting typical samples from historical soil - type maps and combining with high - resolution remote - sensing data, new soil - type maps with high precision and short update cycles can be generated. This provides a reference for the update of traditional soil maps and modern digital soil mapping.