Research on Estimation Model of Carbon Stock Based on Airborne LiDAR and Feature Screening

Xuan Liu,Ruirui Wang,Wei Shi,Xiaoyan Wang,Yaoyao Yang

DOI: https://doi.org/10.3390/su16104133

IF: 3.9

2024-05-16

Sustainability

Abstract:The rapid and accurate estimation of forest carbon stock is important for analyzing the carbon cycle. In order to obtain forest carbon stock efficiently, this paper utilizes airborne LiDAR data to research the applicability of different feature screening methods in combination with machine learning in the carbon stock estimation model. First, Spearman's Correlation Coefficient (SCC) and Extreme Gradient Boosting tree (XGBoost) were used to screen out the variables that were extracted via Airborne LiDAR with a higher correlation with carbon stock. Then, Bagging, K-nearest neighbor (KNN), and Random Forest (RF) were used to construct the carbon stock estimation model. The results show that the height statistical variable is more strongly correlated with carbon stocks than the density statistical variables are. RF is more suitable for the construction of the carbon stock estimation model compared to the instance-based KNN algorithm. Furthermore, the combination of the XGBoost algorithm and the RF algorithm performs best, with an R2 of 0.85 and an MSE of 10.74 on the training set and an R2 of 0.53 and an MSE of 21.81 on the testing set. This study demonstrates the effectiveness of statistical feature screening methods and Random Forest for carbon stock estimation model construction. The XGBoost algorithm has a wider applicability for feature screening.

environmental sciences,environmental studies,green & sustainable science & technology

What problem does this paper attempt to address?

The main objective of this paper is to study the application effect of airborne LiDAR data and feature selection methods in forest carbon stock estimation models. Specifically, the authors aim to: 1. **Utilize airborne LiDAR data**: Extract variables highly related to forest carbon stock, such as statistical variables like average height and maximum height. 2. **Application of feature selection methods**: Use Spearman correlation coefficient (SCC) and Extreme Gradient Boosting (XGBoost) algorithms to screen the extracted LiDAR features to identify variables highly related to carbon stock. 3. **Construct carbon stock estimation models**: Build carbon stock estimation models using three machine learning methods—Bagging, K-Nearest Neighbors (KNN), and Random Forest (RF)—and compare the performance of these models. 4. **Evaluate model performance**: Assess model performance using R² values and Mean Squared Error (MSE) on training and testing sets to determine which combination of feature selection methods and machine learning models is most suitable for carbon stock estimation. The research results of the paper indicate that height statistical variables have a stronger correlation with carbon stock than density statistical variables; among the tested models, the Random Forest (RF) model performed the best, especially when combined with XGBoost feature selection. Additionally, the study validated the effectiveness of statistical feature selection methods and the applicability of the Random Forest algorithm in constructing carbon stock estimation models.

Research on Estimation Model of Carbon Stock Based on Airborne LiDAR and Feature Screening

Retrieval of forest growing stock volume by two different methods using Landsat TM images

Estimation of Bamboo Forest Aboveground Carbon Using the RGLM Model Based on Object-Based Multiscale Segmentation of SPOT-6 Imagery

Remote Sensing Estimation of Forest Carbon Stock Based on Machine Learning Algorithms

Combining Sample Plot Stratification and Machine Learning Algorithms to Improve Forest Aboveground Carbon Density Estimation in Northeast China Using Airborne LiDAR Data

Estimating Forest Stock Volume Based on Airborne Lidar Data

Estimation of Forest Stock Volume Combining Airborne LiDAR Sampling Approaches with Multi-Sensor Imagery

Estimating Aboveground Carbon Stock at the Scale of Individual Trees in Subtropical Forests Using UAV LiDAR and Hyperspectral Data

Satellite Image Fusion Airborne LiDAR Point-Clouds-Driven Machine Learning Modeling to Predict the Carbon Stock of Typical Subtropical Plantation in China

Comparison of Multiple Machine Learning Models for Estimating the Forest Growing Stock in Large-Scale Forests Using Multi-Source Data

Urban carbon stock estimation based on deep learning and UAV remote sensing: a case study in Southern China

Research on Estimating and Evaluating Subtropical Forest Carbon Stocks by Combining Multi-Payload High-Resolution Satellite Data

Advancing forest carbon stocks' mapping using a hierarchical approach with machine learning and satellite imagery

Estimation of Forest Aboveground Biomass and Leaf Area Index Based on Digital Aerial Photograph Data in Northeast China

Forest Aboveground Biomass Estimation Based on Unmanned Aerial Vehicle–Light Detection and Ranging and Machine Learning

Comparison of machine-learning methods for above-ground biomass estimation based on Landsat imagery

Study on the Estimation of Forest Volume Based on Multi-Source Data

Total and component forest aboveground biomass inversion via LiDAR-derived features and machine learning algorithms

A Comparative Analysis of Remote Sensing Estimation of Aboveground Biomass in Boreal Forests Using Machine Learning Modeling and Environmental Data

Simultaneous Models for the Estimation of Main Forest Parameters Based on Airborne LiDAR Data

Quantifying the Effects of Stand and Climate Variables on Biomass of Larch Plantations Using Random Forests and National Forest Inventory Data in North and Northeast China