SALSTM: Segmented Self-Attention Long Short-Term Memory for Long-Term Forecasting

Zhi-Qiang Dai,Jie Li,Yang-Jie Cao,Yong-Xiang Zhang
DOI: https://doi.org/10.1007/s11227-024-06493-z
2025-01-01
Abstract:Time series forecasting plays a crucial role in various fields such as financial market analysis, weather prediction, and traffic flow forecasting. Although long short-term memory (LSTM) performs well in traditional time series forecasting tasks, its performance significantly deteriorates as the prediction sequence length increases. When the prediction length reaches 96, the results are almost irrelevant to the actual task. LSTM networks often struggle with performance degradation as the prediction sequence length increases in long-term time series forecasting (LTSF). To address this issue, we propose an innovative time series forecasting model called segmented self-attention long short-term memory (SALSTM), which combines segmented iteration and intra-segment self-attention mechanisms to tackle the performance degradation of traditional LSTM in LTSF. By reducing the number of recursive iterations and enhancing the ability to capture long-distance dependencies, SALSTM significantly improves prediction accuracy and computational efficiency. Experimental results show that SALSTM outperforms traditional LSTM and current state-of-the-art (SOTA) transformer models across multiple benchmark datasets. The research indicates that LSTM variants still hold potential in LTSF, and thoughtful design can further enhance their effectiveness.
What problem does this paper attempt to address?