Forest canopy height modelling based on photogrammetric data and machine learning methods

Xingsheng Deng,Yujing Liu,Xingdong Cheng
DOI: https://doi.org/10.1111/phor.12507
2024-06-06
The Photogrammetric Record
Abstract:The study employs three machine learning techniques, namely gradient boosting decision tree regression, random forest regression and support vector machine regression, to construct high‐resolution canopy height models in forested areas. These models are based on spectral feature factors extracted from digital orthophoto maps and geometric feature factors derived from digital surface models. Experimental results demonstrate the potential of the canopy height models constructed by the gradient boosting decision tree regression to achieve prediction accuracies of 0.2 m in areas with 50% canopy coverage and 0.6 m in areas with 99% canopy coverage, even when only utilising a subset 20% of the available data sets for model training purposes. Forest topographic survey is a problem that photogrammetry has not solved for a long time. Forest canopy height is a crucial forest biophysical parameter which is used to derive essential information about forest ecosystems. In order to construct a canopy height model in forest areas, this study extracts spectral feature factors from digital orthophoto map and geometric feature factors from digital surface model, which are generated through aerial photogrammetry and LiDAR (light detection and ranging). The maximum information coefficient, Pearson, Kendall, Spearman correlation coefficients, and a new proposed index of relative importance are employed to assess the correlation between each feature factor and forest vertical heights. Gradient boosting decision tree regression is introduced and utilised to construct a canopy height model, which enables the prediction of unknown canopy height in forest areas. Two additional machine learning techniques, namely random forest regression and support vector machine regression, are employed to construct canopy height model for comparative analysis. The data sets from two study areas have been processed for model training and prediction, yielding encouraging experimental results that demonstrate the potential of canopy height model to achieve prediction accuracies of 0.3 m in forested areas with 50% vegetation coverage and 0.8 m in areas with 99% vegetation coverage, even when only a mere 10% of the available data sets are selected as model training data. The above approaches present techniques for modelling canopy height in forested areas with varying conditions, which have been shown to be both feasible and reliable.
geosciences, multidisciplinary,geography, physical,remote sensing,imaging science & photographic technology
What problem does this paper attempt to address?