GradINN: Gradient Informed Neural Network

Filippo Aglietti,Francesco Della Santa,Andrea Piano,Virginia Aglietti
2024-09-03
Abstract:We propose Gradient Informed Neural Networks (GradINNs), a methodology inspired by Physics Informed Neural Networks (PINNs) that can be used to efficiently approximate a wide range of physical systems for which the underlying governing equations are completely unknown or cannot be defined, a condition that is often met in complex engineering problems. GradINNs leverage prior beliefs about a system's gradient to constrain the predicted function's gradient across all input dimensions. This is achieved using two neural networks: one modeling the target function and an auxiliary network expressing prior beliefs, e.g., smoothness. A customized loss function enables training the first network while enforcing gradient constraints derived from the auxiliary network. We demonstrate the advantages of GradINNs, particularly in low-data regimes, on diverse problems spanning non time-dependent systems (Friedman function, Stokes Flow) and time-dependent systems (Lotka-Volterra, Burger's equation). Experimental results showcase strong performance compared to standard neural networks and PINN-like approaches across all tested scenarios.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to efficiently approximate complex engineering systems when the governing equations of the underlying physical systems are completely unknown or cannot be defined. Traditional methods such as standard neural networks (NNs) perform poorly when data is limited. Although physics - informed neural networks (PINNs) can improve performance by embedding known physical knowledge, they rely on detailed prior knowledge of the physical system, which is not feasible in many cases. Therefore, the paper proposes gradient - information neural networks (GradINNs), a new method that uses prior beliefs about system gradients to constrain the gradients of the prediction function, enabling more accurate approximation of physical systems when data is limited. Specifically, GradINNs achieve this goal through two neural networks: one for modeling the objective function and the other as an auxiliary network expressing prior beliefs (such as smoothness). A customized loss function makes it possible to enforce gradient constraints derived from the auxiliary network while training the main network. This method is particularly suitable for low - data - volume situations and has demonstrated superior performance on a variety of problems, including non - time - dependent systems (such as the Friedman function, Stokes flow) and time - dependent systems (such as the Lotka - Volterra system, Burgers' equation). Experimental results show that GradINNs exhibit stronger performance than standard neural networks and PINN - like methods in all test scenarios.