Neural Network Surrogate and Projected Gradient Descent for Fast and Reliable Finite Element Model Calibration: a Case Study on an Intervertebral Disc

Matan Atad,Gabriel Gruber,Marx Ribeiro,Luis Fernando Nicolini,Robert Graf,Hendrik Möller,Kati Nispel,Ivan Ezhov,Daniel Rueckert,Jan S. Kirschke
2024-12-09
Abstract:Accurate calibration of finite element (FE) models is essential across various biomechanical applications, including human intervertebral discs (IVDs), to ensure their reliability and use in diagnosing and planning treatments. However, traditional calibration methods are computationally intensive, requiring iterative, derivative-free optimization algorithms that often take days to converge. This study addresses these challenges by introducing a novel, efficient, and effective calibration method demonstrated on a human L4-L5 IVD FE model as a case study using a neural network (NN) surrogate. The NN surrogate predicts simulation outcomes with high accuracy, outperforming other machine learning models, and significantly reduces the computational cost associated with traditional FE simulations. Next, a Projected Gradient Descent (PGD) approach guided by gradients of the NN surrogate is proposed to efficiently calibrate FE models. Our method explicitly enforces feasibility with a projection step, thus maintaining material bounds throughout the optimization process. The proposed method is evaluated against SOTA Genetic Algorithm and inverse model baselines on synthetic and in vitro experimental datasets. Our approach demonstrates superior performance on synthetic data, achieving an MAE of 0.06 compared to the baselines' MAE of 0.18 and 0.54, respectively. On experimental specimens, our method outperforms the baseline in 5 out of 6 cases. While our approach requires initial dataset generation and surrogate training, these steps are performed only once, and the actual calibration takes under three seconds. In contrast, traditional calibration time scales linearly with the number of specimens, taking up to 8 days in the worst-case. Such efficiency paves the way for applying more complex FE models, potentially extending beyond IVDs, and enabling accurate patient-specific simulations.
Machine Learning
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve the computational efficiency and reliability problems encountered in the calibration process of finite element (FE) models, especially in biomechanical applications such as human intervertebral disc (IVD) modeling. Specifically: 1. **High computational cost of traditional calibration methods**: - Traditional FE model calibration methods are computationally intensive and usually require iterative, derivative - free optimization algorithms, which often take several days to converge. - These methods lack explicit derivative information, resulting in a slow and complex optimization process. 2. **Improving calibration efficiency and accuracy**: - The paper proposes a new efficient and effective calibration method by using a neural network (NN) surrogate model to predict simulation results and significantly reduce the computational cost associated with traditional FE simulations. - A projected gradient descent (PGD) method based on the gradient of the NN surrogate model is introduced to efficiently calibrate the FE model. This method explicitly ensures that the material parameters remain within the feasible range throughout the optimization process through a projection step. 3. **Validating the effectiveness of the new method**: - This method is evaluated on synthetic data sets and in - vitro experimental data sets and compared with existing genetic algorithm (GA) and inverse model baselines, demonstrating its superior performance. - The mean absolute error (MAE) of the new method on synthetic data is 0.06, while the baseline methods are 0.18 and 0.54 respectively. In the experimental samples, the new method outperforms the baseline methods in 5 out of 6 cases. 4. **Potential for practical applications**: - This method not only improves the calibration speed (from several days to a few seconds), but also paves the way for the application of more complex FE models, may be extended to other biomechanical fields, and enables accurate personalized patient simulations. ### Formula presentation - **Projected gradient descent update formula**: \[ X(t + 1)=\Pi_{\text{proj}}\left(X(t)-\eta \nabla_X L\right) \] where: - \(X(t)\) is the current parameter matrix; - \(\eta\) is the learning rate; - \(\nabla_X L\) is the gradient of the loss function \(L\) with respect to \(X\); - \(\Pi_{\text{proj}}\) is the projection function, which ensures that the parameters are within the predefined constraint range. - **Specific implementation of the projection function**: \[ \Pi_{\text{proj}}(X)=\frac{1}{k} \cdot \text{ClipBounds}(x) \] where: - \(x = \frac{1}{k} \sum_{k = 1}^K X_k\) is the row - mean of \(X\); - \(\text{ClipBounds}(x)=\min(\max(x, b_{\text{min}}), b_{\text{max}})\) ensures that the parameters are within the boundaries \(b_{\text{min}}\) and \(b_{\text{max}}\). Through these improvements, the paper provides a faster and more reliable FE model calibration method suitable for complex biomechanical simulations.