Predicting diabetes in adults: identifying important features in unbalanced data over a 5-year cohort study using machine learning algorithm

Maryam Talebi Moghaddam,Yones Jahani,Zahra Arefzadeh,Azizallah Dehghan,Mohsen Khaleghi,Mehdi Sharafi,Ghasem Nikfar
DOI: https://doi.org/10.1186/s12874-024-02341-z
2024-09-28
BMC Medical Research Methodology
Abstract:Imbalanced datasets pose significant challenges in predictive modeling, leading to biased outcomes and reduced model reliability. This study addresses data imbalance in diabetes prediction using machine learning techniques. Utilizing data from the Fasa Adult Cohort Study (FACS) with a 5-year follow-up of 10,000 participants, we developed predictive models for Type 2 diabetes.
health care sciences & services
What problem does this paper attempt to address?