Exploiting nearest neighbor data and fuzzy membership function to address missing values in classification

Kurnia Muludi,Revita Setianingsih,Ridho Sholehurrohman,Akmal Junaidi
DOI: https://doi.org/10.7717/peerj-cs.1968
2024-03-28
PeerJ Computer Science
Abstract:The accuracy of most classification methods is significantly affected by missing values. Therefore, this study aimed to propose a data imputation method to handle missing values through the application of nearest neighbor data and fuzzy membership function as well as to compare the results with standard methods. A total of five datasets related to classification problems obtained from the UCI Machine Learning Repository were used. The results showed that the proposed method had higher accuracy than standard imputation methods. Moreover, triangular method performed better than Gaussian fuzzy membership function. This showed that the combination of nearest neighbor data and fuzzy membership function was more effective in handling missing values and improving classification accuracy.
computer science, information systems, artificial intelligence, theory & methods
What problem does this paper attempt to address?