ForecastPFN: Synthetically-Trained Zero-Shot Forecasting

Samuel Dooley,Gurnoor Singh Khurana,Chirag Mohapatra,Siddartha Naidu,Colin White
2023-11-03
Abstract:The vast majority of time-series forecasting approaches require a substantial training dataset. However, many real-life forecasting applications have very little initial observations, sometimes just 40 or fewer. Thus, the applicability of most forecasting methods is restricted in data-sparse commercial applications. While there is recent work in the setting of very limited initial data (so-called `zero-shot' forecasting), its performance is inconsistent depending on the data used for pretraining. In this work, we take a different approach and devise ForecastPFN, the first zero-shot forecasting model trained purely on a novel synthetic data distribution. ForecastPFN is a prior-data fitted network, trained to approximate Bayesian inference, which can make predictions on a new time series dataset in a single forward pass. Through extensive experiments, we show that zero-shot predictions made by ForecastPFN are more accurate and faster compared to state-of-the-art forecasting methods, even when the other methods are allowed to train on hundreds of additional in-distribution data points.
Machine Learning
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve a key challenge in time - series prediction: **zero - shot prediction**. Specifically, most existing time - series prediction methods require a large number of initial data points for training to learn patterns and make future predictions. However, in many practical applications, especially in business scenarios, the available initial observation data is very limited, sometimes with only 40 or fewer data points. In such cases, traditional prediction methods are difficult to apply. #### Main problems: 1. **Data scarcity problem**: In many real - world time - series prediction tasks, the initial observation data is very limited, usually only a few dozen data points. 2. **Limitations of existing methods**: Existing zero - shot prediction methods (such as Meta - N - BEATS) have unstable performance on different pre - training data sets and rely on specific data distributions. 3. **Computational efficiency problem**: Traditional methods and deep - learning methods usually require a long training time when dealing with large - scale time - series. ### Solutions: To solve these problems, the authors propose a brand - new zero - shot prediction model - **ForecastPFN**. ForecastPFN is a prior - data fitted network (PFN) trained on synthetic data, which can quickly and accurately predict new time - series without any training on the target data set. #### Main innovation points: 1. **Pure synthetic data training**: ForecastPFN is trained entirely with synthetic data, avoiding dependence on real - world data and ensuring the generalization ability of the model. 2. **Efficient zero - shot prediction**: Through one - time offline training, ForecastPFN can predict new time - series in a single forward propagation, greatly increasing the prediction speed. 3. **Multi - scale time - series modeling**: The synthetic data generation model takes into account multi - scale seasonal trends, global trends and noise, enabling the model to capture various complex time - series features. ### Experimental results: Through extensive experiments, the authors demonstrate the superior performance of ForecastPFN on multiple real - world data sets. Even when other methods are allowed to be trained on hundreds of additional data points with the same distribution, the zero - shot prediction of ForecastPFN is still more accurate and faster. ### Summary: The main contribution of this paper is the proposal of ForecastPFN, a brand - new zero - shot time - series prediction model that can provide efficient and accurate predictions in the case of data scarcity. This method not only solves the performance instability problem of existing methods in the case of limited data, but also significantly improves the prediction speed and efficiency.