When and How: Learning Identifiable Latent States for Nonstationary Time Series Forecasting

Zijian Li,Ruichu Cai,Zhenhui Yang,Haiqin Huang,Guangyi Chen,Yifan Shen,Zhengming Chen,Xiangchen Song,Kun Zhang
2024-06-07
Abstract:Temporal distribution shifts are ubiquitous in time series data. One of the most popular methods assumes that the temporal distribution shift occurs uniformly to disentangle the stationary and nonstationary dependencies. But this assumption is difficult to meet, as we do not know when the distribution shifts occur. To solve this problem, we propose to learn IDentifiable latEnt stAtes (IDEA) to detect when the distribution shifts occur. Beyond that, we further disentangle the stationary and nonstationary latent states via sufficient observation assumption to learn how the latent states change. Specifically, we formalize the causal process with environment-irrelated stationary and environment-related nonstationary variables. Under mild conditions, we show that latent environments and stationary/nonstationary variables are identifiable. Based on these theories, we devise the IDEA model, which incorporates an autoregressive hidden Markov model to estimate latent environments and modular prior networks to identify latent states. The IDEA model outperforms several latest nonstationary forecasting methods on various benchmark datasets, highlighting its advantages in real-world scenarios.
Machine Learning
What problem does this paper attempt to address?
The paper primarily addresses the issue of non-stationary time series forecasting and proposes a new method to overcome the limitations present in existing techniques. Specifically, the paper attempts to solve the following key problems: 1. **Handling Changes in Time Distribution**: Non-stationarity, or changes in time distribution, is common in time series data, posing a challenge to the generalization ability of predictive models over time. 2. **Limitations of Existing Methods**: - Assuming uniform distribution changes or instance-based partitioning, which is often too idealistic in practical applications. - Existing methods struggle to accurately separate stable and non-stable dependencies within the time series. 3. **Proposed New Method**: - **Detecting Time Points of Distribution Changes**: First, determine when distribution changes occur. - **Separating Stable and Non-Stable Latent States**: Further distinguish how these latent states change over time. The paper theoretically proves that the latent environment and stable/non-stable variables can be identified and designs a model named IDEA based on these theories. This model combines an autoregressive hidden Markov model to estimate the latent environment and uses a modular prior network architecture to identify stable and non-stable latent variables. ### Summary - **Target Problem**: Address the issue of non-stationary time series forecasting, particularly how to effectively handle non-stationarity in time series data. - **Solution**: Propose the IDEA model, which can detect when changes in time distribution occur and separate stable and non-stable latent states. - **Theoretical Contribution**: Prove the identifiability of the latent environment and stable/non-stable variables. - **Experimental Validation**: Validate the effectiveness and superiority of the IDEA model through experimental results on synthetic and real-world datasets. In summary, this paper aims to improve the performance of non-stationary time series forecasting tasks by proposing a new method, with a particular focus on effectively distinguishing and utilizing the stable and non-stable characteristics in time series data.