Mapping the Growing Stem Volume of the Coniferous Plantations in North China Using Multispectral Data from Integrated GF-2 and Sentinel-2 Images and an Optimized Feature Variable Selection Method

Xinyu Li,Hui Lin,Jiangping Long,Xiaodong Xu
DOI: https://doi.org/10.3390/rs13142740
IF: 5
2021-07-12
Remote Sensing
Abstract:Accurate measurement of forest growing stem volume (GSV) is important for forest resource management and ecosystem dynamics monitoring. Optical remote sensing imagery has great application prospects in forest GSV estimation on regional and global scales as it is easily accessible, has a wide coverage, and mature technology. However, their application is limited by cloud coverage, data stripes, atmospheric effects, and satellite sensor errors. Combining multi-sensor data can reduce such limitations as it increases the data availability, but also causes the multi-dimensional problem that increases the difficulty of feature selection. In this study, GaoFen-2 (GF-2) and Sentinel-2 images were integrated, and feature variables and data scenarios were derived by a proposed adaptive feature variable combination optimization (AFCO) program for estimating the GSV of coniferous plantations. The AFCO algorithm was compared to four traditional feature variable selection methods, namely, random forest (RF), stepwise random forest (SRF), fast iterative feature selection method for k-nearest neighbors (KNN-FIFS), and the feature variable screening and combination optimization procedure based on the distance correlation coefficient and k-nearest neighbors (DC-FSCK). The comparison indicated that the AFCO program not only considered the combination effect of feature variables, but also optimized the selection of the first feature variable, error threshold, and selection of the estimation model. Furthermore, we selected feature variables from three datasets (GF-2, Sentinel-2, and the integrated data) following the AFCO and four other feature selection methods and used the k-nearest neighbors (KNN) and random forest regression (RFR) to estimate the GSV of coniferous plantations in northern China. The results indicated that the integrated data improved the GSV estimation accuracy of coniferous plantations, with relative root mean square errors (RMSErs) of 15.0% and 19.6%, which were lower than those of GF-2 and Sentinel-2 data, respectively. In particular, the texture feature variables derived from GF-2 red band image have a significant impact on GSV estimation performance of the integrated dataset. For most data scenarios, the AFCO algorithm gained more accurate GSV estimates, as the RMSErs were 30.0%, 23.7%, 17.7%, and 17.5% lower than those of RF, SRF, KNN-FIFS, and DC-FSCK, respectively. The GSV distribution map obtained by the AFCO method and RFR model matched the field observations well. This study provides some insight into the application of optical images, optimization of the feature variable combination, and modeling algorithm selection for estimating the GSV of coniferous plantations.
environmental sciences,imaging science & photographic technology,remote sensing,geosciences, multidisciplinary
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to improve the estimation accuracy of Growing Stem Volume (GSV) of coniferous forests in the temperate monsoon climate zone of northern China by utilizing fused data from the GaoFen-2 (GF-2) and Sentinel-2 satellite images, combined with an optimized feature variable selection method. Specifically: 1. **Data Fusion and Feature Extraction**: The study integrates multispectral images from GF-2 and Sentinel-2 and extracts candidate feature variables such as vegetation indices and texture features. 2. **Feature Variable Selection Algorithm**: An Adaptive Feature Combination and Optimization (AFCO) procedure is proposed, which not only considers the combinatorial effects between feature variables but also optimizes the selection of the first feature variable, error threshold, and model selection. 3. **Model Comparison**: The AFCO method is compared with four other traditional feature selection methods (Random Forest, Stepwise Random Forest, Fast Iterative Feature Selection based on K-Nearest Neighbors, and Feature Screening and Combination Optimization based on Distance Correlation Coefficient). Two machine learning models (K-Nearest Neighbors Regression and Random Forest Regression) are used to estimate GSV. Through these methods, the paper validates the effectiveness of fused data in improving GSV estimation accuracy and demonstrates that the AFCO method can achieve more accurate GSV estimation results under different data scenarios.