Interpretable spatio-temporal attention LSTM model for flood forecasting

Yukai Ding,Yuelong Zhu,Jun Feng,Pengcheng Zhang,Zirun Cheng
DOI: https://doi.org/10.1016/j.neucom.2020.04.110
IF: 6
2020-08-01
Neurocomputing
Abstract:Modeling interpretable artificial intelligence (AI) for flood forecasting represents a serious challenge: both accuracy and interpretability are indispensable. Because of the uncertainty and nonlinearity of flood, existing hydrological solutions always achieve low prediction robustness while machine learning (ML) approaches neglect the physical interpretability of models. In this paper, we focus on the need for flood forecasting and propose an interpretable Spatio-Temporal Attention Long Short Term Memory model (STA-LSTM) based on LSTM and attention mechanism. We use dynamic attention mechanism and LSTM to build model, Max-Min method to normalize data, variable control method to select hyperparameters, and Adam algorithm to train the model. Emphasis is placed on the visualization and interpretation of attention weights. Experiment results on three small and medium basins in China suggest that the proposed STA-LSTM model outperforms Historical Average (HA), Fully Connected Network (FCN), Convolutional Neural Networks (CNN), Graph Convolutional Networks (GCN), original LSTM (LSTM), spatial attention LSTM (SA-LSTM), and temporal attention LSTM (TA-LSTM) in most cases. Visualization and interpretation of spatial and temporal attention weights reflect the reasonability of the proposed attention-based model.
computer science, artificial intelligence
What problem does this paper attempt to address?