Abstract:Classical neural ODEs trained with explicit methods are intrinsically limited by stability, crippling their efficiency and robustness for stiff learning problems that are common in graph learning and scientific machine learning. We present a semi-implicit neural ODE approach that exploits the partitionable structure of the underlying dynamics. Our technique leads to an implicit neural network with significant computational advantages over existing approaches because of enhanced stability and efficient linear solves during time integration. We show that our approach outperforms existing approaches on a variety of applications including graph classification and learning complex dynamical systems. We also demonstrate that our approach can train challenging neural ODEs where both explicit methods and fully implicit methods are intractable.

What problem does this paper attempt to address?

### What problems does this paper attempt to solve? This paper aims to solve the stability and efficiency problems faced by traditional neural ordinary differential equations (Neural ODEs) when dealing with stiff learning problems. Specifically, traditional neural ODEs are trained using explicit methods, and when facing stiff problems, they are limited by stability, resulting in low computational efficiency and difficulty in convergence. These problems are particularly common in graph learning and scientific machine learning. #### Main problems and solutions 1. **Stability problems**: - **Limitations of explicit methods**: Explicit solvers (such as the Runge - Kutta method) are simple and easy to use, but when dealing with stiff systems, they require very small time steps to maintain stability, which makes the computational cost extremely high. - **Limitations of implicit methods**: Although fully implicit methods have better stability, the process of solving nonlinear systems is complex and time - consuming, which may lead to convergence failure or computational bottlenecks. 2. **Computational efficiency problems**: - **Inefficiency of explicit methods**: Due to the need for extremely small time steps, explicit methods require a large number of network evaluations (NFEs) in each training round, resulting in overly long training times. - **High computational cost of implicit methods**: Although implicit methods are stable, they need to solve complex nonlinear equations in each iteration, increasing the computational burden. To solve the above problems, the author proposes a new semi - implicit neural ordinary differential equation (Semi - Implicit Neural ODE, SINODE) method. This method divides the right side of the system into linear and nonlinear parts and adopts an implicit - explicit (IMEX) integration method, thereby improving computational efficiency and stability. #### Specific contributions 1. **Proposed the SINODE model**: Based on a system with linear - nonlinear partitioning, it uses the IMEX method for integration, significantly improving computational efficiency and stability. 2. **Developed a differentiable IMEX ODE solver**: This solver uses the discrete adjoint method, avoiding the cost of back - propagation through the ODE solver, and supports efficient linear solution techniques, which is suitable for small - batch training of SINODE. 3. **Demonstrated the effectiveness and efficiency of SINODE**: Through graph learning tasks and two time - series prediction problems (including stiff and chaotic systems), it verifies the superior performance of SINODE in dealing with complex dynamic systems. In conclusion, this paper solves the stability and efficiency problems of traditional neural ODEs when dealing with stiff problems by introducing the SINODE method, making it possible for a wider range of applications.

Semi-Implicit Neural Ordinary Differential Equations

Stiff neural ordinary differential equations

Semi-Implicit Neural Solver for Time-dependent Partial Differential Equations

Training Stiff Neural Ordinary Differential Equations with Implicit Single-Step Methods

Implicit regularization of deep residual networks towards neural ODEs

Learning Neural Event Functions for Ordinary Differential Equations

Training Stiff Neural Ordinary Differential Equations with Explicit Exponential Integration Methods

Constrained Neural Ordinary Differential Equations with Stability Guarantees

Invariant Neural Ordinary Differential Equations

Proximal Implicit ODE Solvers for Accelerating Learning Neural ODEs

Two Sides of The Same Coin: Bridging Deep Equilibrium Models and Neural ODEs via Homotopy Continuation

Efficient, Accurate and Stable Gradients for Neural ODEs

Embedding Capabilities of Neural ODEs

Time Dependence in Non-Autonomous Neural ODEs

Do Residual Neural Networks discretize Neural Ordinary Differential Equations?

Neural Ordinary Differential Equations for Model Order Reduction of Stiff Systems

Bayesian Neural Ordinary Differential Equations

ODEN: A Framework to Solve Ordinary Differential Equations using Artificial Neural Networks

The advance of neural ordinary differential ordinary differential equations

Stabilized Neural Differential Equations for Learning Dynamics with Explicit Constraints

Anamnesic Neural Differential Equations with Orthogonal Polynomial Projections