Assessment and quantification of ovarian reserve on the basis of machine learning models
Ting Ding,Wu Ren,Tian Wang,Yun Han,Wenqing Ma,Man Wang,Fangfang Fu,Yan Li,Shixuan Wang
DOI: https://doi.org/10.3389/fendo.2023.1087429
IF: 6.055
2023-03-16
Frontiers in Endocrinology
Abstract:Background: Early detection of ovarian aging is of huge importance, although no ideal marker or acknowledged evaluation system exists. The purpose of this study was to develop a better prediction model to assess and quantify ovarian reserve using machine learning methods. Methods: This is a multicenter, nationwide population-based study including a total of 1,020 healthy women. For these healthy women, their ovarian reserve was quantified in the form of ovarian age, which was assumed equal to their chronological age, and least absolute shrinkage and selection operator (LASSO) regression was used to select features to construct models. Seven machine learning methods, namely artificial neural network (ANN), support vector machine (SVM), generalized linear model (GLM), K-nearest neighbors regression (KNN), gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), and light gradient boosting machine (LightGBM) were applied to construct prediction models separately. Pearson's correlation coefficient (PCC), mean absolute error (MAE), and mean squared error (MSE) were used to compare the efficiency and stability of these models. Results: Anti-Müllerian hormone (AMH) and antral follicle count (AFC) were detected to have the highest absolute PCC values of 0.45 and 0.43 with age and held similar age distribution curves. The LightGBM model was thought to be the most suitable model for ovarian age after ranking analysis, combining PCC, MAE, and MSE values. The LightGBM model obtained PCC values of 0.82, 0.56, and 0.70 for the training set, the test set, and the entire dataset, respectively. The LightGBM method still held the lowest MAE and cross-validated MSE values. Further, in two different age groups (20–35 and >35 years), the LightGBM model also obtained the lowest MAE value of 2.88 for women between the ages of 20 and 35 years and the second lowest MAE value of 5.12 for women over the age of 35 years. Conclusion: Machine learning methods combining multi-features were reliable in assessing and quantifying ovarian reserve, and the LightGBM method turned out to be the approach with the best result, especially in the child-bearing age group of 20 to 35 years.
endocrinology & metabolism