Global Crop-Specific Fertilization Dataset from 1961-2019

Fernando Coello,Thomas Decorte,Iris Janssens,Steven Mortier,Jordi Sardans,Josep Peñuelas,Tim Verdonck
2024-06-14
Abstract:As global fertilizer application rates increase, high-quality datasets are paramount for comprehensive analyses to support informed decision-making and policy formulation in crucial areas such as food security or climate change. This study aims to fill existing data gaps by employing two machine learning models, eXtreme Gradient Boosting and HistGradientBoosting algorithms to produce precise country-level predictions of nitrogen ($N$), phosphorus pentoxide ($P_2O_5$), and potassium oxide ($K_2O$) application rates. Subsequently, we created a comprehensive dataset of 5-arcmin resolution maps depicting the application rates of each fertilizer for 13 major crop groups from 1961 to 2019. The predictions were validated by both comparing with existing databases and by assessing the drivers of fertilizer application rates using the model's SHapley Additive exPlanations. This extensive dataset is poised to be a valuable resource for assessing fertilization trends, identifying the socioeconomic, agricultural, and environmental drivers of fertilizer application rates, and serving as an input for various applications, including environmental modeling, causal analysis, fertilizer price predictions, and forecasting.
Computational Engineering, Finance, and Science
What problem does this paper attempt to address?
The paper aims to address the issue of insufficient data on specific fertilization for global crops. Specifically, the study employs two machine learning models (eXtreme Gradient Boosting and HistGradientBoosting) to generate accurate national-level predictions of nitrogen (N), phosphorus pentoxide (P2O5), and potassium oxide (K2O) application rates. Subsequently, based on these predictions, a high-resolution (5 arc minutes) atlas was created, covering the fertilization rates of 13 major crop groups from 1961 to 2019. The study not only validates the predictions by comparing them with existing databases but also uses the models' SHapley Additive exPlanations (SHAP) values to assess the drivers of fertilization rates. This comprehensive dataset is of significant value for evaluating fertilization trends, identifying socio-economic and environmental drivers of fertilization rates, and serving as input for various applications such as environmental modeling, causal analysis, fertilizer price prediction, and forecasting. In summary, the paper aims to fill existing data gaps and provide a comprehensive global crop-specific fertilization dataset to support decision-making and policy formation in critical areas such as food security and climate change.