Comparative evaluation of statistical and machine learning models for weather-driven wheat yield forecasting across different districts of Punjab
Kulwinder Kaur Gill,Kavita Bhatt,Akansha,Parul Setiya,Sandeep Singh Sandhu,Baljeet Kaur
DOI: https://doi.org/10.1007/s12517-024-12077-1
2024-09-22
Arabian Journal of Geosciences
Abstract:Predicting crop yields before harvest is important for making and carrying out policies about food safety, transportation costs, import-export, storage, and selling of agricultural goods. The weather is a key factor in crop growth and its development. Therefore, models that include meteorological variables can predict reliable forecasts for crop output; however, selecting the appropriate model for use in agricultural production forecasting can be challenging. This study investigates the development of wheat yield prediction models using various multivariate analysis techniques and weather indices derived from meteorological data collected over 22 years in Punjab, India. Five different modeling approaches, including stepwise multiple linear regression (SMLR), LASSO, elastic net (ELNET), artificial neural network (ANN), and ridge regression, were employed and compared for their effectiveness in predicting wheat yield. The models were calibrated using data from 17 years (2000–01 to 2016–17) and validated using data from the subsequent 5 years (2017–18 to 2021–22). Evaluation metrics such as R 2 , root mean square error (RMSE), normalized root mean square error (NRMSE), mean biased error (MBE), and modeling efficiency (EF) were utilized to assess model performance. The results indicate varying degrees of performance across districts and modeling techniques. ANN demonstrated the highest performance during both calibration and validation periods, followed closely by LASSO and ELNET. However, certain districts showed discrepancies in model fit, with some models performing better than others depending on the specific district. Overall, ANN emerged as the most reliable approach for wheat yield prediction in Punjab followed by ELNET and LASSO, offering valuable insights for agricultural planning and management. This comprehensive analysis provides valuable contributions to the field of crop yield prediction, enhancing understanding of the complex interactions between weather variables and agricultural outcomes.
geosciences, multidisciplinary