FDNet: Focal Decomposed Network for Efficient, Robust and Practical Time Series Forecasting

Li Shen,Yuning Wei,Yangzhu Wang,Huaxin Qiu
2023-06-19
Abstract:This paper presents FDNet: a Focal Decomposed Network for efficient, robust and practical time series forecasting. We break away from conventional deep time series forecasting formulas which obtain prediction results from universal feature maps of input sequences. In contrary, FDNet neglects universal correlations of input elements and only extracts fine-grained local features from input sequence. We show that: (1) Deep time series forecasting with only fine-grained local feature maps of input sequence is feasible upon theoretical basis. (2) By abandoning global coarse-grained feature maps, FDNet overcomes distribution shift problem caused by changing dynamics of time series which is common in real-world applications. (3) FDNet is not dependent on any inductive bias of time series except basic auto-regression, making it general and practical. Moreover, we propose focal input sequence decomposition method which decomposes input sequence in a focal manner for efficient and robust forecasting when facing Long Sequence Time series Input (LSTI) problem. FDNet achieves competitive forecasting performances on six real-world benchmarks and reduces prediction MSE by 38.4% on average compared with other thirteen SOTA baselines. The source code is available at <a class="link-external link-https" href="https://github.com/OrigamiSL/FDNet" rel="external noopener nofollow">this https URL</a>.
Machine Learning,Artificial Intelligence,Information Theory
What problem does this paper attempt to address?
This paper attempts to solve two main problems: 1. **The necessity of Input - Correlation - Oriented Mechanism (ICOM) in time - series prediction**: - The paper analyzes that traditional deep time - series prediction models rely on mechanisms for extracting global features from input sequences (such as self - attention mechanism and dilated convolution), which aim to capture the general associations between input elements. However, the author presents a new view, believing that this global feature extraction is not necessarily necessary and may even cause problems when dealing with real - world time - series, such as distribution shift. Specifically, the author points out that since time - series in the real world are often non - stationary and the statistical properties or dynamic changes of their local windows are large, it is difficult to extract a universal representation for all local windows. This causes ICOM - based networks to be prone to fall into local optima, and the prediction performance is easily affected by random weight initialization. 2. **Long - Sequence Time - Series Input (LSTI) problem**: - Time - series prediction with long input sequences faces problems such as performance degradation, over - fitting, and parameter explosion. Although some existing methods (such as Time - Series Forecasting Transformer, TSFT) claim to be able to capture long - term dependencies, they still encounter performance degradation problems when the input sequence is too long. For this reason, the paper proposes the focal input sequence decomposition method. This method effectively solves the LSTI problem by decomposing the input sequence into multiple sub - sequences and assigning different feature extraction layers according to the time distance between the sub - sequences and the predicted elements. This method not only improves the model's accuracy but also significantly reduces the number of parameters, making the model more efficient and robust when dealing with extremely long input sequences. ### Solution overview To address the above problems, the paper proposes FDNet (Focal Decomposed Network), which has the following characteristics: - **Decomposed prediction formula**: FDNet abandons traditional global feature extraction and only extracts fine - grained local features from the input sequence. In this way, FDNet avoids the distribution shift problem caused by the dynamic changes of time - series. - **Focal input sequence decomposition**: This method decomposes the input sequence into multiple sub - sequences according to the time distance. The sub - sequences closer to the predicted elements are shorter and have more feature extraction layers. This can ensure that the connection between the input and the predicted elements gradually weakens and becomes shallower as the time distance increases, while avoiding parameter explosion. ### Main contributions 1. A novel decomposed prediction formula is proposed, which not only decomposes the prediction process of the predicted elements but also further decomposes the feature extraction process of the input elements. 2. FDNet, a simple and effective network architecture using the decomposed prediction formula, is constructed, which consists of basic linear projection layers and CNNs. 3. The focal input sequence decomposition method is proposed, which effectively solves the LSTI problem and enables the model to remain efficient and robust when dealing with extremely long input sequences. 4. Extensive experiments on six benchmark datasets show that FDNet outperforms 13 other state - of - the - art methods in multivariate and univariate prediction tasks respectively, with the average prediction MSE reduced by 36.2% and 40.5% respectively. 5. Ablation studies show that the focal input sequence decomposition method is competitive in dealing with the LSTI problem and can be combined with other prediction formulas. Through these innovations, FDNet not only theoretically proves that time - series prediction without ICOM is feasible but also shows excellent performance and robustness in practical applications.