EVALUATING MACHINE LEARNING ALGORITHMS FOR BREAST CANCER DETECTION: A STUDY ON ACCURACY AND PREDICTIVE PERFORMANCE
,Md Al-Imran,Salma Akter,,Md Abu Sufian Mozumder,,Rowsan Jahan Bhuiyan,,Md Al Rafi,,Md Shahriar Mahmud Bhuiyan,,Gourab Nicholas Rodrigues,,Md Nazmul Hossain Mir,,Md Amit Hasan,,Ashim Chandra Das,,Md. Emran Hossen,
DOI: https://doi.org/10.37547/tajet/volume06issue09-04
2024-09-01
Abstract:This study evaluates several machine learning algorithms—Support Vector Machine (SVM), Random Forest, Logistic Regression, Decision Tree (C4.5), and k-Nearest Neighbors (KNN)—for breast cancer detection using the Breast Cancer Wisconsin Diagnostic dataset. We implemented comprehensive pre-processing and model evaluation with Scikit-learn in Python. Our findings show that SVM achieved the highest accuracy, with 99.9% on the training set and 98.50% on the testing set, indicating superior performance in handling high-dimensional data. Random Forest also performed well, with accuracies of 98.5% and 98.20%, respectively. Logistic Regression and Decision Tree models provided reliable predictions when tuned, while KNN was less effective. SVM and Random Forest are recommended for clinical decision support systems due to their high accuracy and robustness.