Model ensembles of artificial neural networks and support vector regression for improved accuracy in the prediction of vegetation conditions

Chrisgone Adede,Robert Oboko,Peter W. Wagacha,Clement Atzberger
DOI: https://doi.org/10.48550/arXiv.1908.10104
2019-08-27
Abstract:There is increasing need for highly predictive and stable models for the prediction of drought as an aid to better planning for drought response. This paper presents the performance of both homogenous and heterogenous model ensembles in the prediction of drought severity using the study case techniques of artificial neural networks (ANN) and support vector regression (SVR). For each of the homogenous and heterogenous model ensembles, the study investigates the performance of three model ensembling approaches: linear averaging (non-weighted), ranked weighted averaging and model stacking using artificial neural networks. Using the approach of 'over-produce then select', the study used 17 years of data on 16 selected variables for predictive drought monitoring to build 244 individual ANN and SVR models from which 111 models were selected for the building of the model ensembles. The results indicate marginal superiority of heterogenous to homogenous model ensembles. Model stacking is shown to realize models that are superior in performance in the prediction of future vegetation conditions as compared to the linear averaging and weighted averaging approaches. The best performance from the heterogenous stacked model ensembles recorded an R2 of 0.94 in the prediction of future vegetation conditions as compared to an R2 of 0.83 and R2 of 0.78 for both ANN and SVR respectively in the traditional champion model approaches to the realization of predictive models. We conclude that despite the computational resource intensiveness of the model ensembling approach to drought prediction, the returns in terms of model performance is worth the investment, especially in the context of the recent exponential increase in computational power.
Applications
What problem does this paper attempt to address?
The paper aims to address the issues of accuracy and stability in drought prediction. Specifically, the study seeks to improve the accuracy of future vegetation condition predictions by constructing homogeneous and heterogeneous model ensembles of Artificial Neural Networks (ANN) and Support Vector Regression (SVR). The main objectives of the paper include: 1. **Comparing the performance of homogeneous and heterogeneous model ensembles**: The study compares the performance of homogeneous model ensembles based on ANN and SVR with heterogeneous model ensembles in predicting drought severity. 2. **Exploring different ensemble methods**: The research examines the effectiveness of three ensemble methods: linear averaging (non-weighted), weighted averaging, and model stacking based on Artificial Neural Networks. 3. **Optimizing model selection**: The study adopts a "produce excessively and then select" strategy, selecting 111 models from 17 years of data to construct the model ensembles, and explores how to choose the best ensemble members. The final results show that heterogeneous model stacking performs the best in predicting future vegetation conditions, offering higher prediction accuracy compared to traditional single-model methods. Although model ensemble methods require significant computational resources, the performance improvement they bring is considered worthwhile, especially in the context of rapidly increasing computational capabilities.