Screening ovarian cancer by using risk factors: machine learning assists

Raoof Nopour
DOI: https://doi.org/10.1186/s12938-024-01219-x
2024-02-14
BioMedical Engineering OnLine
Abstract:Ovarian cancer (OC) is a prevalent and aggressive malignancy that poses a significant public health challenge. The lack of preventive strategies for OC increases morbidity, mortality, and other negative consequences. Screening OC through risk prediction could be leveraged as a powerful strategy for preventive purposes that have not received much attention. So, this study aimed to leverage machine learning approaches as predictive assistance solutions to screen high-risk groups of OC and achieve practical preventive purposes.
engineering, biomedical
What problem does this paper attempt to address?
The paper aims to address the issue of early screening and prevention strategies for Ovarian Cancer (OC). Ovarian cancer is a common malignant tumor that poses a significant challenge to public health. Due to the lack of effective prevention strategies, the incidence, mortality, and negative impact of ovarian cancer are relatively high. Therefore, researchers hope to use machine learning methods to screen high-risk groups for ovarian cancer to achieve effective prevention. To achieve this goal, researchers collected data from 1,516 women suspected of having ovarian cancer from six clinical centers in Sari, Iran, and employed six different machine learning algorithms (including XG-Boost, Random Forest (RF), J-48 Decision Tree, Support Vector Machine (SVM), K-Nearest Neighbor (KNN), and Artificial Neural Network (ANN)) to build predictive models. The goal of these models is to identify which individuals belong to the high-risk group for ovarian cancer. After comparing the performance of different models, the study found that the XG-Boost model achieved an Area Under the Receiver Operating Characteristic Curve (AU-ROC) of 0.93 (95% confidence interval [0.91–0.95]), making it the best model for predicting ovarian cancer. Additionally, the study evaluated the importance of different factors in predicting ovarian cancer, with family history of cancer, age at menopause, and history of chest X-rays being considered the most influential predictors. Finally, to validate the generalizability of the model, researchers tested the performance of the XG-Boost model using datasets from two external clinical centers. The results showed that the model also performed well on external datasets. This indicates that the XG-Boost model can be effectively used for early screening of ovarian cancer, thereby helping to improve public health and reduce the adverse outcomes associated with ovarian cancer.