ISMRNN: An Implicitly Segmented RNN Method with Mamba for Long-Term Time Series Forecasting

GaoXiang Zhao,Li Zhou,XiaoQiang Wang
2024-08-04
Abstract:Long time series forecasting aims to utilize historical information to forecast future states over extended horizons. Traditional RNN-based series forecasting methods struggle to effectively address long-term dependencies and gradient issues in long time series problems. Recently, SegRNN has emerged as a leading RNN-based model tailored for long-term series forecasting, demonstrating state-of-the-art performance while maintaining a streamlined architecture through innovative segmentation and parallel decoding techniques. Nevertheless, SegRNN has several limitations: its fixed segmentation disrupts data continuity and fails to effectively leverage information across different segments, the segmentation strategy employed by SegRNN does not fundamentally address the issue of information loss within the recurrent structure. To address these issues, we propose the ISMRNN method with three key enhancements: we introduce an implicit segmentation structure to decompose the time series and map it to segmented hidden states, resulting in denser information exchange during the segmentation phase. Additionally, we incorporate residual structures in the encoding layer to mitigate information loss within the recurrent structure. To extract information more effectively, we further integrate the Mamba architecture to enhance time series information extraction. Experiments on several real-world long time series forecasting datasets demonstrate that our model surpasses the performance of current state-of-the-art models.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
This paper attempts to solve several key problems in long - term time - series prediction, especially the challenges encountered when using recurrent neural networks (RNNs) for long - term time - series prediction. Specifically, the paper aims to address the following issues: 1. **Long - term Dependence and Gradient Problems**: - Traditional RNN - based time - series prediction methods are not effective in handling long - term dependence and are vulnerable to the vanishing or exploding gradient problems. This makes it difficult for the model to accurately capture trends and patterns over long time spans. 2. **Information Loss Caused by Segmentation Strategies**: - Although the recently proposed SegRNN model has improved the performance of long - term time - series prediction through segmentation and parallel decoding techniques, its fixed segmentation method disrupts the continuity of data and fails to effectively utilize the information between different segments, resulting in information loss. 3. **Information Loss in Recursive Structures**: - Although the segmentation strategy of SegRNN reduces the number of recursive iterations, there is still the problem of information loss in the recursive structure, especially when the prediction range is extended, this information loss will be more obvious. To solve these problems, the paper proposes the ISMRNN (Implicitly Segmented RNN with Mamba) method, which mainly improves the effect of long - term time - series prediction through the following three enhancement strategies: - **Implicit Segmentation Structure**: Introduce an implicit segmentation structure to decompose and map the time series to segmented hidden states to achieve more intensive information exchange and maintain data continuity. - **Residual Structure**: Add a residual structure in the encoding layer so that some information can bypass the recursive structure and reduce information loss. - **Mamba Architecture**: Integrate the Mamba architecture to enhance time - series information extraction and better capture long - term dependence. These improvements significantly improve the performance of the model in long - term time - series prediction tasks while maintaining low memory usage and training time. Experimental results show that ISMRNN outperforms the existing state - of - the - art models on multiple real - world datasets. In summary, this paper aims to solve the deficiencies of existing RNN models in long - term time - series prediction by improving the segmentation strategy, reducing information loss, and enhancing information extraction ability, thereby improving the accuracy and efficiency of prediction.