Predictability Analysis and Prediction of Discrete Weather and Financial Time-Series Data with a Hamiltonian-Based Filter-Projection Approach

Henrik Kiefer,Denis Furtel,Cihan Ayaz,Anton Klimek,Jan O. Daldrop,Roland R. Netz
2024-09-24
Abstract:The generalized Langevin equation (GLE), derived by projection from a general many-body Hamiltonian, exactly describes the dynamics of an arbitrary coarse-grained variable in a complex environment. However, analysis and prediction of real-world data with the GLE is hampered by slow transient or seasonal data components and time-discretization effects. Machine-learning (ML) techniques work but are computer-resource demanding and difficult to interpret. We show that by convolution filtering, time-series data decompose into fast, transient and seasonal components that each obey Hamiltonian dynamics and, thus, can be separately analyzed by projection techniques. We introduce methods to extract all GLE parameters from highly discretized time-series data and to forecast future data including the environmental stochasticity. For daily-resolved weather data, our analysis reveals non-Markovian memory that decays over a few days. Our prediction accuracy is comparable to commercial (<a class="link-external link-http" href="http://weather.com" rel="external noopener nofollow">this http URL</a>) and ML long short-term memory (LSTM) methods at a reduced computational cost by a factor of $10^2-10^3$ compared to LSTM. For financial data, memory is very short-ranged and the dynamics effectively is Markovian, in agreement with the efficient-market hypothesis; consequently, models simpler than the GLE are sufficient. Our GLE framework is an efficient and interpretable method for the analysis and prediction of complex time-series data.
Data Analysis, Statistics and Probability
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to use the Generalized Langevin Equation (GLE) to analyze and predict discrete - time - series data, especially weather and financial data. Specifically, the paper focuses on the following aspects: 1. **Multiscale Dynamic Coupling**: Solve the multiscale dynamic coupling problem between short - time stochastic effects and long - time transient or seasonal effects. 2. **Non - Gaussian Correlation**: Deal with the non - Gaussian correlations existing in many non - trivial systems. 3. **Time - Discretization Effect**: Overcome the influence of time - discretization on the application of GLE. 4. **Non - equilibrium Effect**: Handle the slowly relaxing non - equilibrium effects, which make many tools in statistical mechanics ineffective. To address these problems, the author proposes a method that combines convolution filtering and projection techniques to decompose time - series data into rapidly changing, transient and seasonal components, and analyze them with Hamiltonian dynamics respectively. This method can not only extract all GLE parameters, but also predict future data, including the influence of environmental randomness. ### Main Contributions - **Efficiency**: Compared with traditional machine - learning methods (such as LSTM), this method significantly reduces the demand for computational resources while maintaining high prediction accuracy. - **Interpretability**: All model parameters are interpretable, which makes this method more advantageous in natural - science research and high - risk social - political decision - making. - **Scope of Application**: This method is applicable to the analysis and prediction of various complex time - series data, including weather and financial data. ### Application Examples - **Weather Data**: By analyzing the daily maximum temperature data in Berlin through GLE, the non - Markovian memory effect is revealed, and the accuracy and efficiency of this method in weather prediction are demonstrated. - **Financial Data**: By analyzing Bitcoin price, S&P 500 index and Yen / US Dollar exchange rate data through GLE, it is found that the memory effect of financial data is very short, which is in line with the efficient - market hypothesis. In conclusion, this paper provides an efficient and interpretable method to analyze and predict complex time - series data by introducing a new GLE framework, especially outstanding in the applications in the weather and financial fields.