Forecasting large collections of time series: feature-based methods

Li Li,Feng Li,Yanfei Kang
2023-09-25
Abstract:In economics and many other forecasting domains, the real world problems are too complex for a single model that assumes a specific data generation process. The forecasting performance of different methods changes depending on the nature of the time series. When forecasting large collections of time series, two lines of approaches have been developed using time series features, namely feature-based model selection and feature-based model combination. This chapter discusses the state-of-the-art feature-based methods, with reference to open-source software implementations.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that in economic and other forecasting fields, a single model is difficult to deal with complex real - world problems, because the forecasting performance of different methods will change with the change of time - series properties. When a large number of time series need to be forecasted, it becomes especially important to use time - series features to select the most appropriate model or determine the best combination of candidate models. Specifically, the paper explores the latest progress of feature - based methods in forecasting a large number of time series and refers to open - source software implementations. The paper mainly focuses on how to use the features of time series for model selection or model combination to improve the accuracy of forecasting. The paper mentions that due to the "No - Free - Lunch theorem", no model can perform best for all time series. Therefore, for the forecasting of a large number of time series, instead of choosing one model to be applied to all data, the most appropriate model is selected or the optimal model combination is determined according to the features of each time series. This process can be automated through meta - learning, that is, by training a meta - learner to learn the relationship between individual forecasts and forecasting performance, so as to determine the best forecasting method or combination weights. The paper also reviews relevant research from the 1970s to the present, including the algorithm selection framework proposed by Rice (1976), 99 rules developed by Collopy and Armstrong (1992) based on 18 features, Shah (1997) using multiple features for time - series classification and predicting the best model through discriminant analysis, etc. In addition, the paper discusses automated methods of feature extraction, forecasting pruning, and solutions to some practical forecasting problems, such as intermittent demand and uncertainty estimation. In general, this paper aims to provide a systematic methodology to meet the challenges in forecasting a large number of time series by reviewing and analyzing existing feature - based forecasting methods, especially how to effectively use the features of time series to improve the accuracy and robustness of forecasting.