Correlational graph attention-based Long Short-Term Memory network for multivariate time series prediction

Shuang Han,Hongbin Dong,Xuyang Teng,Xiaohui Li,Xiaowei Wang
DOI: https://doi.org/10.1016/j.asoc.2021.107377
IF: 8.7
2021-07-01
Applied Soft Computing
Abstract:<p>Multi-variate time series prediction models use the historical information of multiple exogenous series to predict the future values of the target series. At present, attention-based deep networks can obtain the spatial correlations between target series and multiple exogenous series, but it is difficult to capture temporal correlations across multiple time steps, which play a role in improving prediction accuracy. To that end, we propose a correlational graph attention-based Long Short-Term Memory network (CGA-LSTM), a nested network that nests the correlational attention mechanism in the graph attention mechanism to strengthen the spatio-temporal correlations. We construct the time series as a graph structure, where nodes represent time steps in exogenous series. To obtain sufficient expressive power, we propose a nonlinear transformation, correlational attention-based LSTM, instead of the original linear transformation to transform the exogenous series into higher-level features. The original linear transformations cannot obtain spatial correlations. The correlational attention mechanism can adaptively select the relevant exogenous series to obtain the spatial correlations. Then calculating the weight coefficients between the node and its neighbors to capture the temporal correlations. The performance of the proposed algorithm was tested on 4 datasets and compared with state-of-the-art methods. The experimental results show that our model is effective, can provide higher prediction accuracy.</p>
computer science, artificial intelligence, interdisciplinary applications
What problem does this paper attempt to address?
The paper attempts to address two main issues in multivariate time series forecasting: 1. **Spatial Correlation**: How to capture highly dynamic, time-varying spatial correlations between the target sequence and multiple exogenous sequences. 2. **Temporal Correlation**: How to capture temporal correlations at different time steps. Specifically, multivariate time series data include the values of the target sequence and multiple exogenous sequences, which are widely present in scenarios such as stock data in financial markets, photovoltaic power data from power companies, and air quality and water quality data from urban sensors. The forecasting task is very challenging, mainly influenced by the following two factors: - **Spatial Correlation**: The spatial correlation between the target sequence and multiple exogenous sequences is highly dynamic and changes over time. - **Temporal Correlation**: Temporal correlations at different time steps play an important role in improving forecasting accuracy. For example, solar radiation, humidity, and temperature affect photovoltaic power generation. At night, solar radiation is zero, and no power is generated; as the sun rises, photovoltaic power first increases and then decreases, but sometimes the trend of photovoltaic power differs from solar radiation due to the influence of temperature and humidity. The importance of each exogenous sequence to the target sequence is also different, and most time series data are collected by sensors, which are easily affected by the surrounding environment, so detecting useful exogenous sequences is crucial for forecasting. Traditional Autoregressive Integrated Moving Average (ARIMA) models and Vector Autoregression (VAR) models have limitations in handling multivariate time series data and cannot effectively model spatiotemporal correlations. Therefore, the authors propose a Correlation Graph Attention-based Long Short-Term Memory network (CGA-LSTM), aiming to enhance the ability to capture spatiotemporal correlations through a nested correlation attention mechanism.