Abstract:Differentiable particle filters are an emerging class of models that combine sequential Monte Carlo techniques with the flexibility of neural networks to perform state space inference. This paper concerns the case where the system may switch between a finite set of state-space models, i.e. regimes. No prior approaches effectively learn both the individual regimes and the switching process simultaneously. In this paper, we propose the neural network based regime learning differentiable particle filter (RLPF) to address this problem. We further design a training procedure for the RLPF and other related algorithms. We demonstrate competitive performance compared to the previous state-of-the-art algorithms on a pair of numerical experiments.

What problem does this paper attempt to address?

This paper aims to solve the problem in the state - space model (SSM) that the system may switch dynamically among a set of candidate models or states. Specifically, existing methods cannot effectively learn each state and its switching process simultaneously. To solve this problem, the author proposes the Regime Learning Particle Filter (RLPF) based on neural networks. RLPF can not only learn each state model but also learn the switching dynamics between these states. ### Main contributions: 1. **Neural network parameterization**: A scheme for parameterizing state - switching systems using neural networks is proposed, which can learn the switching dynamics. 2. **Novel training algorithm**: A new training algorithm is designed. By combining the supervised mean - squared - error loss (MSE) and the unsupervised evidence lower - bound loss (ELBO), the accuracy of the model is improved. ### Problem background: - **State - space model (SSM)**: It describes a system consisting of two parts, one is the unobserved discrete - time Markov process \(\{x_t\}\), and the other is its noisy observation \(\{y_t\}\). - **State switching**: The system can randomly jump among multiple different state - space models (referred to as "states" or "regimes"), which are indicated by \(\{k_t\}\). - **Limitations of existing methods**: Existing methods either assume that the switching dynamics are known or cannot learn each state model and the switching process simultaneously. ### Method overview: 1. **Model re - definition**: - Redefine the state - switching dynamics as: \[ k_0 \sim K_\theta(k_0), \] \[ k_t \geq 1 \sim K_\theta(k_t | r_{t - 1}), \] \[ r_t \geq 0 = R_\theta(k_t', r_{t - 1}), \] where \(r_t\) is a low - dimensional embedding vector used to represent historical information. 2. **Parameterize the switching dynamics**: - Use neural networks to parameterize the switching dynamics. The specific form is: \[ r_t = R_\theta(k_t', r_{t - 1}) = \sigma(\Theta_1 r_{t - 1}) \odot \sigma(\Theta_2 k_t') \odot r_{t - 1}+ \tanh(\Theta_3 k_t') \odot \sigma(\Theta_4 k_t'), \] \[ K_\theta'(k_t' | r_{t - 1}) = |\Theta_5 \tanh(\Theta_6 r_{t - 1})| \cdot k_t', \] \[ K_\theta(k_t' | r_{t - 1}) = \frac{K_\theta'(k_t' | r_{t - 1})}{\sum_{c \in K} K_\theta'(c' | r_{t - 1})}, \] where \(\sigma\) and \(\tanh\) are the sigmoid and hyperbolic tangent activation functions respectively, and \(\odot\) represents the Hadamard product. 3. **Training strategy**: - Combine the supervised mean - squared - error loss (MSE) and the unsupervised evidence lower - bound loss (ELBO) for training: \[ L_\theta^{\text{RLPF}}(\hat{x}_{0:T}, x_{0:T}, y_{0:T}) = L_{\text{ELBO}}(\{x_{0:T}, y_{0:T}\})+\lambda L_{\text{MSE}}(\hat{x}_{0:T})

Regime Learning for Differentiable Particle Filters

Differentiable Bootstrap Particle Filters for Regime-Switching Models

Differentiable Interacting Multiple Model Particle Filtering

Normalising Flow-based Differentiable Particle Filters

How to Train Your Differentiable Filter

Differentiable and Stable Long-Range Tracking of Multiple Posterior Modes

Learning state and proposal dynamics in state-space models using differentiable particle filters and neural networks

Revisiting semi-supervised training objectives for differentiable particle filters

Differentiable Particle Filtering using Optimal Placement Resampling

Differentiable Particle Filters with Smoothly Jittered Resampling

Reinforcement Learning for Continuous-Time Mean-Variance Portfolio Selection in a Regime-Switching Market

Particle Filter Recurrent Neural Networks

A sequential learning algorithm based on adaptive particle filtering for RBF networks

A Neural RDE approach for continuous-time non-Markovian stochastic control problems

Enhanced SMC$^2$: Leveraging Gradient Information from Differentiable Particle Filters Within Langevin Proposals

Predictive State Recurrent Neural Networks

Multimodal Learning of Soft Robot Dynamics using Differentiable Filters

Rao-Blackwellised Particle Filtering for Dynamic Bayesian Networks

Variational learning for switching state-space models

Design of Interacting Particle Systems for Fast Linear Quadratic RL

Generalized Reinforcement Learning: Experience Particles, Action Operator, Reinforcement Field, Memory Association, and Decision Concepts