Rule Extraction from Support Vector Machines Using Ensemble Learning Approach: an Application for Diagnosis of Diabetes

Longfei Han,Senlin Luo,Jianmin Yu,Limin Pan,Songjing Chen
DOI: https://doi.org/10.1109/jbhi.2014.2325615
IF: 7.7
2015-01-01
IEEE Journal of Biomedical and Health Informatics
Abstract:Diabetes mellitus is a chronic disease and a worldwide public health challenge. It has been shown that 50-80% proportion of T2DM is undiagnosed. In this paper, support vector machines are utilized to screen diabetes, and an ensemble learning module is added, which turns the "black box" of SVM decisions into comprehensible and transparent rules, and it is also useful for solving imbalance problem. Results on China Health and Nutrition Survey data show that the proposed ensemble learning method generates rule sets with weighted average precision 94.2% and weighted average recall 93.9% for all classes. Furthermore, the hybrid system can provide a tool for diagnosis of diabetes, and it supports a second opinion for lay users.
What problem does this paper attempt to address?