Abstract:Differentiable particle filters are an emerging class of models that combine sequential Monte Carlo techniques with the flexibility of neural networks to perform state space inference. This paper concerns the case where the system may switch between a finite set of state-space models, i.e. regimes. No prior approaches effectively learn both the individual regimes and the switching process simultaneously. In this paper, we propose the neural network based regime learning differentiable particle filter (RLPF) to address this problem. We further design a training procedure for the RLPF and other related algorithms. We demonstrate competitive performance compared to the previous state-of-the-art algorithms on a pair of numerical experiments.
What problem does this paper attempt to address?
This paper aims to solve the problem in the state - space model (SSM) that the system may switch dynamically among a set of candidate models or states. Specifically, existing methods cannot effectively learn each state and its switching process simultaneously. To solve this problem, the author proposes the Regime Learning Particle Filter (RLPF) based on neural networks. RLPF can not only learn each state model but also learn the switching dynamics between these states.
### Main contributions:
1. **Neural network parameterization**: A scheme for parameterizing state - switching systems using neural networks is proposed, which can learn the switching dynamics.
2. **Novel training algorithm**: A new training algorithm is designed. By combining the supervised mean - squared - error loss (MSE) and the unsupervised evidence lower - bound loss (ELBO), the accuracy of the model is improved.
### Problem background:
- **State - space model (SSM)**: It describes a system consisting of two parts, one is the unobserved discrete - time Markov process \(\{x_t\}\), and the other is its noisy observation \(\{y_t\}\).
- **State switching**: The system can randomly jump among multiple different state - space models (referred to as "states" or "regimes"), which are indicated by \(\{k_t\}\).
- **Limitations of existing methods**: Existing methods either assume that the switching dynamics are known or cannot learn each state model and the switching process simultaneously.
### Method overview:
1. **Model re - definition**:
- Redefine the state - switching dynamics as:
\[
k_0 \sim K_\theta(k_0),
\]
\[
k_t \geq 1 \sim K_\theta(k_t | r_{t - 1}),
\]
\[
r_t \geq 0 = R_\theta(k_t', r_{t - 1}),
\]
where \(r_t\) is a low - dimensional embedding vector used to represent historical information.
2. **Parameterize the switching dynamics**:
- Use neural networks to parameterize the switching dynamics. The specific form is:
\[
r_t = R_\theta(k_t', r_{t - 1}) = \sigma(\Theta_1 r_{t - 1}) \odot \sigma(\Theta_2 k_t') \odot r_{t - 1}+ \tanh(\Theta_3 k_t') \odot \sigma(\Theta_4 k_t'),
\]
\[
K_\theta'(k_t' | r_{t - 1}) = |\Theta_5 \tanh(\Theta_6 r_{t - 1})| \cdot k_t',
\]
\[
K_\theta(k_t' | r_{t - 1}) = \frac{K_\theta'(k_t' | r_{t - 1})}{\sum_{c \in K} K_\theta'(c' | r_{t - 1})},
\]
where \(\sigma\) and \(\tanh\) are the sigmoid and hyperbolic tangent activation functions respectively, and \(\odot\) represents the Hadamard product.
3. **Training strategy**:
- Combine the supervised mean - squared - error loss (MSE) and the unsupervised evidence lower - bound loss (ELBO) for training:
\[
L_\theta^{\text{RLPF}}(\hat{x}_{0:T}, x_{0:T}, y_{0:T}) = L_{\text{ELBO}}(\{x_{0:T}, y_{0:T}\})+\lambda L_{\text{MSE}}(\hat{x}_{0:T})