A decoder-only foundation model for time-series forecasting

Abhimanyu Das,Weihao Kong,Rajat Sen,Yichen Zhou
2024-04-18
Abstract:Motivated by recent advances in large language models for Natural Language Processing (NLP), we design a time-series foundation model for forecasting whose out-of-the-box zero-shot performance on a variety of public datasets comes close to the accuracy of state-of-the-art supervised forecasting models for each individual dataset. Our model is based on pretraining a patched-decoder style attention model on a large time-series corpus, and can work well across different forecasting history lengths, prediction lengths and temporal granularities.
Computation and Language,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the problem of designing a decoder-only foundational model (TimesFM) for time series forecasting, which can achieve near state-of-the-art zero-shot prediction performance on unseen datasets. Specifically, the model is pre-trained on a large-scale time series corpus, including real-world data (such as Google Trends, Wikipedia page view statistics) and synthetic data. Experiments show that the model can achieve accurate zero-shot predictions across different domains, forecast horizons, and time granularities, thereby significantly reducing the additional training burden and computational requirements for downstream forecasting tasks. Compared to existing large language models, this model achieves better zero-shot performance with a much smaller scale.