Abstract:Panel data involving longitudinal measurements of the same set of participants taken over multiple time points is common in studies to understand childhood development and disease modeling. Deep hybrid models that marry the predictive power of neural networks with physical simulators such as differential equations, are starting to drive advances in such applications. The task of modeling not just the observations but the hidden dynamics that are captured by the measurements poses interesting statistical/computational questions. We propose a probabilistic model called ME-NODE to incorporate (fixed + random) mixed effects for analyzing such panel data. We show that our model can be derived using smooth approximations of SDEs provided by the Wong-Zakai theorem. We then derive Evidence Based Lower Bounds for ME-NODE, and develop (efficient) training algorithms using MC based sampling methods and numerical ODE solvers. We demonstrate ME-NODE's utility on tasks spanning the spectrum from simulations and toy data to real longitudinal 3D imaging data from an Alzheimer's disease (AD) study, and study its performance in terms of accuracy of reconstruction for interpolation, uncertainty estimates and personalized prediction.

What problem does this paper attempt to address?

### Problems the paper attempts to solve The paper "Mixed - effect Neural ODE: A Variational Approximation Method for Analyzing Panel Data Dynamics" aims to solve several key problems in panel data analysis: 1. **Modeling of longitudinal data**: - Panel data usually involves repeated measurements of the same group of participants at multiple time points. This type of data is very common in research on child development and disease modeling. - Although existing deep mixture models combine the predictive power of neural networks and physical simulators (such as differential equations), they still face statistical and computational challenges in modeling hidden dynamics. 2. **Introduction of mixed effects**: - In order to better capture the variability between individuals and the variability within individuals, the paper proposes a probabilistic model - Mixed - effect Neural ODE (ME - NODE). - By introducing fixed effects and random effects, this model can more accurately describe the dynamic changes in panel data. 3. **Variational approximation and efficient training**: - The paper derives the evidence lower bound (ELBO) of ME - NODE by the method of smooth approximation of SDE, and develops an efficient training algorithm based on Monte Carlo sampling and numerical ODE solvers. - These methods enable the model to perform effective parameter estimation in high - dimensional situations without the need for back - propagation through SDE solvers. 4. **Verification in practical applications**: - Through a series of experiments, including simulated data, toy data, and real - life longitudinal 3D imaging data from Alzheimer's disease research, the paper verifies the performance of ME - NODE in interpolation, uncertainty estimation, and personalized prediction. ### Formula presentation - **Standard form of SDE**: \[ dz_t = f_\mu(z, t)dt + L\Sigma(z, t)\circ d\beta(t) \] where \( z_t\in\mathbb{R} \), \( f \) and \( L \) represent the drift function and the noise sensitivity function respectively, and \( \mu \) and \( \Sigma \) are unknown parameters. - **Simplified form of ME - NODE**: \[ \dot{z}_t = f(z, t)+g(z, t)b \] where \( b\sim\mathcal{N}(0, 1) \). - **Mixed - effect model**: \[ \phi_i=\nu(X_i\beta + U_i b_i)+\epsilon_i \] where \( X_i\in\mathbb{R}^{n\times m} \) is the covariate matrix, \( \beta\in\mathbb{R}^m \) is the fixed effect, \( b_i\sim\mathcal{N}(0, \Sigma_b) \) is the random effect, \( U_i \) is the design matrix of the random effect, \( \phi_i\in\mathbb{R}^n \) is the response variable, and \( \epsilon_i\sim\mathcal{N}(0, \Sigma_{\epsilon_i}) \) is the noise term. - **ELBO of variational auto - encoder (VAE)**: \[ \log p(x)\geq - \text{KL}(q(z)\|p(z))+\mathbb{E}_{q(z)}[\log p(x|z)] \] where \( z \) is a latent space sample sampled from the approximate posterior distribution \( q(z) \), \( p(z) \) is the prior distribution, \( x \) is the reconstruction of the observed sample, and \( p(x|z) \) is the likelihood of the reconstruction. ### Summary By introducing the Mixed - effect Neural ODE model, the paper solves the statistical and computational problems in hidden dynamic modeling in panel data. This model can not only capture the variability between individuals and the variability within individuals, but also, through variational approximation and efficient training algorithms, achieve in high - dimensional situations.

Mixed Effects Neural ODE: A Variational Approximation for Analyzing the Dynamics of Panel Data

Beyond Predictions in Neural ODEs: Identification and Interventions.

Understanding Latent Timescales in Neural Ordinary Differential Equation Models for Advection-Dominated Dynamical Systems

BrainODE: Dynamic Brain Signal Analysis via Graph-Aided Neural Ordinary Differential Equations

Bayesian Neural Ordinary Differential Equations

Physics-Enhanced Neural Ordinary Differential Equations: Application to Industrial Chemical Reaction Systems

Balanced Neural ODEs: nonlinear model order reduction and Koopman operator approximations

A statistical approach to latent dynamic modeling with differential equations

Latent Space Energy-based Neural ODEs

Neural Ordinary Differential Equations for Intervention Modeling

ODE Discovery for Longitudinal Heterogeneous Treatment Effects Inference

Parameter-varying neural ordinary differential equations with partition-of-unity networks

Anamnesic Neural Differential Equations with Orthogonal Polynomial Projections

Modulated Neural ODEs

An amortized approach to non-linear mixed-effects modeling based on neural posterior estimation

Neural Mixed Effects for Nonlinear Personalized Predictions

Neural modal ordinary differential equations: Integrating physics-based modeling with neural ordinary differential equations for modeling high-dimensional monitored structures

Efficient Amortised Bayesian Inference for Hierarchical and Nonlinear Dynamical Systems

Multilevel Monte Carlo for a class of Partially Observed Processes in Neuroscience

Dynamical Modeling for Non-Gaussian Data with High-Dimensional Sparse Ordinary Differential Equations

Neural Ordinary Differential Equation based Sequential Image Registration for Dynamic Characterization