Leveraging Non-Decimated Wavelet Packet Features and Transformer Models for Time Series Forecasting

Guy P Nason,James L. Wei
2024-03-13
Abstract:This article combines wavelet analysis techniques with machine learning methods for univariate time series forecasting, focusing on three main contributions. Firstly, we consider the use of Daubechies wavelets with different numbers of vanishing moments as input features to both non-temporal and temporal forecasting methods, by selecting these numbers during the cross-validation phase. Secondly, we compare the use of both the non-decimated wavelet transform and the non-decimated wavelet packet transform for computing these features, the latter providing a much larger set of potentially useful coefficient vectors. The wavelet coefficients are computed using a shifted version of the typical pyramidal algorithm to ensure no leakage of future information into these inputs. Thirdly, we evaluate the use of these wavelet features on a significantly wider set of forecasting methods than previous studies, including both temporal and non-temporal models, and both statistical and deep learning-based methods. The latter include state-of-the-art transformer-based neural network architectures. Our experiments suggest significant benefit in replacing higher-order lagged features with wavelet features across all examined non-temporal methods for one-step-forward forecasting, and modest benefit when used as inputs for temporal deep learning-based models for long-horizon forecasting.
Methodology,Machine Learning
What problem does this paper attempt to address?
This paper explores how to improve the accuracy of univariate time series forecasting by combining wavelet analysis techniques with machine learning methods. Specifically, it investigates the effects of using Daubechies wavelets with different vanishing moment numbers as input features, and compares the performance of non-decimated wavelet transform and non-decimated wavelet packet transform in computing these features. The paper also evaluates the advantages of applying these wavelet features to a broader range of time series forecasting methods, including statistical and deep learning approaches, especially for short-term and long-term predictions. The experiments demonstrate that using wavelet features can significantly improve the one-step forecasting performance of non-time series methods and provide moderate benefits for long-term predictions of deep learning-based temporal models.