A study on deep feature extraction to detect and classify Acute Lymphoblastic Leukemia (ALL)

Sabit Ahamed Preanto,Md. Taimur Ahad,Yousuf Rayhan Emon,Sumaya Mustofa,Md Alamin
2024-09-11
Abstract:Acute lymphoblastic leukaemia (ALL) is a blood malignancy that mainly affects adults and children. This study looks into the use of deep learning, specifically Convolutional Neural Networks (CNNs), for the detection and classification of ALL. Conventional techniques for ALL diagnosis, such bone marrow biopsy, are costly and prone to mistakes made by hand. By utilising automated technologies, the research seeks to improve diagnostic accuracy. The research uses a variety of pre-trained CNN models, such as InceptionV3, ResNet101, VGG19, DenseNet121, MobileNetV2, and DenseNet121, to extract characteristics from pictures of blood smears. ANOVA, Recursive Feature Elimination (RFE), Random Forest, Lasso, and Principal Component Analysis (PCA) are a few of the selection approaches used to find the most relevant features after feature extraction. Following that, machine learning methods like Naïve Bayes, Random Forest, Support Vector Machine (SVM), and K-Nearest Neighbours (KNN) are used to classify these features. With an 87% accuracy rate, the ResNet101 model produced the best results, closely followed by DenseNet121 and VGG19. According to the study, CNN-based models have the potential to decrease the need for medical specialists by increasing the speed and accuracy of ALL diagnosis. To improve model performance, the study also recommends expanding and diversifying datasets and investigating more sophisticated designs such as transformers. This study highlights how well automated deep learning systems do medical diagnosis.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to detect and classify acute lymphoblastic leukemia (ALL) by using deep - learning techniques, especially convolutional neural networks (CNNs). Traditional ALL diagnosis methods, such as bone marrow biopsy, are not only costly but also prone to misdiagnosis due to human factors. Therefore, this research aims to use automation technology to improve the accuracy of diagnosis, reduce the dependence on medical experts, and at the same time accelerate the diagnosis speed. In the study, a variety of pre - trained CNN models (such as InceptionV3, ResNet101, VGG19, DenseNet121, MobileNetV2) were used to extract features from blood smear images, and the most relevant features were selected by methods such as ANOVA, recursive feature elimination (RFE), random forest, Lasso regression and principal component analysis (PCA). Subsequently, machine - learning methods such as naive Bayes, random forest, support vector machine (SVM), K - nearest neighbor (KNN) were used to classify these features. The research results show that the ResNet101 model performs best, with an accuracy rate of 87%, followed closely by DenseNet121 and VGG19. This indicates that CNN - based models have great potential in improving the speed and accuracy of ALL diagnosis.