Inter-laboratory replicability and sensitivity study of a finite element model to quantify human femur failure load: case of metastases

Marc Gardegaront,Amelie Sas,Denis Brizard,Aurelie Levillain,Francois Bermond,Cyrille B. Confavreux,Jean-Baptiste Pialat,G. Harry van Lenthe,Helene Follet,David Mitton
2024-02-14
Abstract:Metastases increase the risk of fracture when affecting the femur. Consequently, clinicians need to know if the patients femur can withstand the stress of daily activities. The current tools used in clinics are not sufficiently precise. A new method, the CT-scan-based finite element analysis, gives good predictive results. However, none of the existing models were tested for reproducibility. This is a critical issue to address in order to apply the technique on a large cohort around the world to help evaluate bone metastatic fracture risk in patients. Please see pdf file
Numerical Analysis,Computation
What problem does this paper attempt to address?
The problems that this paper attempts to solve mainly include three aspects: 1. **Reproducibility**: Verify the consistency of the results of a finite - element model used to predict the failure load of the human femur among different laboratories. Specifically, the research aims to evaluate whether the original model can produce the same results in different research teams, especially when using the same CT - scan data set. 2. **Replicability**: Evaluate the performance of this model on different data sets. The research hopes to understand whether the performance of the model will remain consistent or whether there are significant differences when using data sets from different sources. 3. **Global Sensitivity Analysis**: Study the sensitivity of the model to input parameters. Through this method, it can be determined which parameters have the greatest impact on the model output, thus providing a basis for improving the model. ### Specific Problem Description - **Background**: Bone metastases increase the risk of femoral fractures, affecting patients' daily lives and treatment outcomes. Existing clinical tools such as the Mirels score are not accurate enough, so more reliable prediction methods are needed. CT - scan - based finite - element analysis (FEA) models have been proven to have good predictive effects, but these models have not been verified for reproducibility and replicability. - **Objective**: This study aims to evaluate the reproducibility, replicability, and global sensitivity of an existing and promising femur failure load prediction model (the Leuven model). The specific objectives are as follows: - Verify the reproducibility of this model in different laboratories. - Evaluate the replicability of this model on different data sets. - Analyze the sensitivity of the model to key input parameters to understand which factors have the greatest impact on the prediction results. ### Method Overview - **Data Sets**: Two data sets were used. One is 8 femoral CT - scan data with surgical defects from the Leuven team, and the other is 16 complete femur and 6 femoral CT - scan data with surgical defects from the Lyon team. - **Model Evaluation**: Reproducibility is evaluated by comparing the results of the original model and the replicated model on the same data set; replicability is evaluated by comparing the results of the replicated model on two different data sets. - **Sensitivity Analysis**: The Morris method is used for global sensitivity analysis to evaluate the sensitivity of the model to parameters such as density calibration coefficients, segmentation, orientation, and femur length. ### Conclusions - **Reproducibility**: The original model and the replicated model show a high correlation on the Leuven data set (\( r^2 = 0.95 \)), but the replicated model systematically overestimates the failure load (with a deviation of 14%). - **Replicability**: The performance of the replicated model on the Lyon data set is not as good as that on the Leuven data set, especially when predicting femurs with surgical defects, with a lower correlation (\( r^2 = 0.03 \)) and a larger error. - **Sensitivity Analysis**: The model is most sensitive to the density calibration coefficient (with an average impact of 12.5%), followed by factors such as segmentation, orientation, and femur length. Through these studies, the authors hope to provide a more reliable and trustworthy femur failure load prediction model for future research and clinical applications.