Quantifying regional variability of machine-learning-based snow water equivalent estimates across the Western United States

Dane Liljestrand,Ryan Johnson,S. McKenzie Skiles,Steven Burian,Josh Christensen
DOI: https://doi.org/10.1016/j.envsoft.2024.106053
IF: 5.471
2024-04-22
Environmental Modelling & Software
Abstract:Seasonal snow-derived water is a critical component of the water supply in the mountains and downstream regions, and the accurate characterization of available water in the form of snow-water-equivalent (SWE), peak SWE, and snowmelt onset are essential inputs for water management efforts. Arising from recent advancements in artificial intelligence (AI) and machine learning (ML), we introduce a large-scale ML SWE model leveraging publicly available data sources and open-source software. The model demonstrates the application of a limited feature space in a relatively simple ML architecture without the need for process-based formulations to effectively estimate spatially continuous SWE at a daily temporal resolution. Beginning with in situ SWE measurements (i.e., SNOTEL), lidar-derived terrain features, and temporal variables, we employ localized feature engineering and optimization via gradient-boosting decision trees to identify regionally unique drivers of snowpack dynamics and use the optimal features to train regionally independent artificial neural networks to estimate regional SWE at a 1-km spatial resolution. The model results yield respectable skill in reconstructed 1-km gridded SWE magnitudes in a hindcast simulation of the 2019 water year that is independent of the training and testing data. Comparing model estimates to over 6200 observations, the model demonstrates a weighted RMSE of 15.4-cm, Kling-Gupta Efficiency metric of 0.86, and a percent bias of 0.71% across 23 snow-influenced regions in the western U.S. The model simulation produces peak SWE estimates within 10-cm for twenty of the twenty-three regions, demonstrating capability in effectively capturing regional snow accumulation processes. The demonstration of low-error ML workflows capable of providing near-real-time, spatially continuous SWE estimates at a high spatial resolution provides proof-of-concept and a foundation to effectively update snow state variables that drive water supply forecasts in snow-dominated regions.
environmental sciences,engineering, environmental,water resources,computer science, interdisciplinary applications
What problem does this paper attempt to address?