Entanglement-Embedded Recurrent Network Architecture: Tensorized Latent State Propagation and Chaos Forecasting

Xiangyi Meng,Tong Yang
DOI: https://doi.org/10.48550/arXiv.2006.14698
2020-06-11
Abstract:Chaotic time series forecasting has been far less understood despite its tremendous potential in theory and real-world applications. Traditional statistical/ML methods are inefficient to capture chaos in nonlinear dynamical systems, especially when the time difference $\Delta t$ between consecutive steps is so large that a trivial, ergodic local minimum would most likely be reached instead. Here, we introduce a new long-short-term-memory (LSTM)-based recurrent architecture by tensorizing the cell-state-to-state propagation therein, keeping the long-term memory feature of LSTM while simultaneously enhancing the learning of short-term nonlinear complexity. We stress that the global minima of chaos can be most efficiently reached by tensorization where all nonlinear terms, up to some polynomial order, are treated explicitly and weighted equally. The efficiency and generality of our architecture are systematically tested and confirmed by theoretical analysis and experimental results. In our design, we have explicitly used two different many-body entanglement structures---matrix product states (MPS) and the multiscale entanglement renormalization ansatz (MERA)---as physics-inspired tensor decomposition techniques, from which we find that MERA generally performs better than MPS, hence conjecturing that the learnability of chaos is determined not only by the number of free parameters but also the tensor complexity---recognized as how entanglement entropy scales with varying matricization of the tensor.
Numerical Analysis,Disordered Systems and Neural Networks,Machine Learning,Chaotic Dynamics,Quantum Physics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the challenges in chaotic time - series prediction, especially when the time step (\(\Delta t\)) is large, traditional statistical and machine - learning methods have difficulty in capturing the chaotic characteristics in nonlinear dynamical systems. Specifically: 1. **Difficulty in chaotic prediction**: Due to the sensitive dependence of chaotic systems on initial conditions, any small error will be amplified exponentially over time, leading to a sharp decline in the accuracy of multi - step prediction. 2. **The problem of local minima in training**: When the actual time interval \(\Delta t\) is large, the neural network training process is prone to fall into trivial and ergodic local minima rather than the global optimal solution. 3. **Limitations of existing methods**: Although there has been a large amount of research on predicting chaotic time - series using recurrent neural networks (RNN), these methods still have deficiencies in capturing chaotic features, especially when dealing with long - time intervals. To solve these problems, the author proposes a new LSTM - based architecture, which enhances its ability to capture short - term nonlinear complexity and long - term memory through tensorization. Specific improvements include: - **Tensorized state propagation**: A tensorized state - propagation function is introduced, so that all nonlinear terms can be explicitly processed and given equal weights. - **Quantum entanglement structure**: Two tensor - decomposition techniques in quantum physics, matrix product states (MPS) and multi - scale entanglement renormalization approximation (MERA), are utilized to reduce the degrees of freedom of tensors and improve the expressive ability of the model. - **Experimental verification**: The effectiveness and universality of the new architecture are proved through theoretical analysis and experimental results, especially its performance on typical chaotic systems such as the Lorenz system and the logistic map is better than that of traditional methods. In conclusion, this paper aims to improve the LSTM architecture by introducing tensorization and quantum entanglement structures, so as to more effectively capture the complex dynamics in chaotic time - series and overcome the limitations of existing methods under large time intervals.