Comparative Analysis of Machine Learning Algorithms for Breast Cancer Classification: SVM Outperforms XGBoost, CNN, RNN, and Others

Prithwish Ghosh,Debashis Chatterjee
DOI: https://doi.org/10.1101/2024.04.22.590658
2024-04-26
Abstract:This study evaluates ten machine learning algorithms for classifying breast cancer cases as malignant or benign based on physical attributes. Algorithms tested include XGBoost, CNN, RNN, AdaBoost, Adaptive Decision Learner, fLSTM, GRU, Random Forest, SVM, and Logistic Regression. Using a robust dataset from UCI machine learning Breast Cancer, SVM emerged as the most accurate, achieving 98.2456% accuracy. While AdaBoost, Logistic Regression, Neural Networks, and Random Forest showed promise, none matched SVM’s accuracy. These findings underscore the potential of machine learning, particularly SVMs, in cancer diagnosis and treatment by analyzing physical attributes for improved diagnostics and targeted therapies.
Bioinformatics
What problem does this paper attempt to address?
This paper aims to evaluate the effectiveness of multiple machine learning algorithms in classifying breast cancer as malignant or benign based on physical features. The study compared ten algorithms, including XGBoost, Convolutional Neural Network (CNN), Recurrent Neural Network (RNN), AdaBoost, Adaptive Decision Learning, Long Short-Term Memory Network (LSTM), Gated Recurrent Units (GRU), Random Forest, Support Vector Machine (SVM), and Logistic Regression. Through the analysis of the UCI Machine Learning Breast Cancer dataset, SVM demonstrated the highest accuracy, reaching 98.2456%. Although other algorithms such as AdaBoost, Logistic Regression, Neural Networks, and Random Forest also showed some accuracy, they were unable to surpass SVM. These results highlight the potential of machine learning, especially SVM, in cancer diagnosis and treatment, by analyzing physical features to improve diagnostic accuracy and targeted therapy.