Interpretable System Identification and Long-term Prediction on Time-Series Data

Xiaoyi Liu,Duxin Chen,Wenjia Wei,Xia Zhu,Wenwu Yu
2023-03-02
Abstract:Time-series prediction has drawn considerable attention during the past decades fueled by the emerging advances of deep learning methods. However, most neural network based methods lack interpretability and fail in extracting the hidden mechanism of the targeted physical system. To overcome these shortcomings, an interpretable sparse system identification method without any prior knowledge is proposed in this study. This method adopts the Fourier transform to reduces the irrelevant items in the dictionary matrix, instead of indiscriminate usage of polynomial functions in most system identification methods. It shows an interpretable system representation and greatly reduces computing cost. With the adoption of $l_1$ norm in regularizing the parameter matrix, a sparse description of the system model can be achieved. Moreover, Three data sets including the water conservancy data, global temperature data and financial data are used to test the performance of the proposed method. Although no prior knowledge was known about the physical background, experimental results show that our method can achieve long-term prediction regardless of the noise and incompleteness in the original data more accurately than the widely-used baseline data-driven methods. This study may provide some insight into time-series prediction investigations, and suggests that an white-box system identification method may extract the easily overlooked yet inherent periodical features and may beat neural-network based black-box methods on long-term prediction tasks.
Machine Learning,Systems and Control,Numerical Analysis
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve several key challenges in time - series prediction, especially the problem of long - term stable prediction. Specifically: 1. **Lack of interpretability**: Most neural - network - based methods, while performing well in time - series prediction, lack the ability to explain the hidden mechanisms of the target physical system. This makes these methods difficult to be applied to large - scale industrial systems that require reliability and interpretability. 2. **Inability to extract intrinsic periodic features**: Most existing system identification methods rely on polynomial functions to construct a dictionary matrix. This method may overlook the potential periodic features in the data, thus affecting the accuracy of prediction. 3. **High computational cost**: Traditional system identification methods usually involve a large number of basic functions (such as polynomials, trigonometric functions, etc.), resulting in high computational complexity and cost, especially when dealing with large - scale data. 4. **Poor stability in long - term prediction**: For long - term prediction tasks (for example, prediction of more than 1,000 steps), existing methods often have difficulty maintaining the accuracy and stability of prediction due to insufficient understanding of the internal mechanisms of the system. To solve these problems, the paper proposes a new interpretable sparse system identification method (SIABF), which improves the existing technology in the following ways: - **Using the discrete Fourier transform (DFT)**: By using DFT to transform the time - series into the frequency domain, the number of irrelevant items is reduced and the adaptive basis functions are automatically selected. This not only improves the interpretability of the model but also significantly reduces the computational cost. - **Introducing L1 regularization**: Using the L1 norm to regularize the parameter matrix, a sparse description of the system is achieved, thereby improving the accuracy and stability of prediction. - **No prior knowledge required**: This method can automatically extract useful periodic features from the data without any prior knowledge about the physical background, and is applicable to various types of time - series data (such as water conservancy data, global temperature data, and financial data). In summary, the main purpose of the paper is to develop a new method that can achieve long - term stable prediction and is highly interpretable, in order to overcome the limitations of existing time - series prediction methods.