A Novel Momentum-Based Deep Learning Techniques for Medical Image Classification and Segmentation

Koushik Biswas,Ridal Pal,Shaswat Patel,Debesh Jha,Meghana Karri,Amit Reza,Gorkem Durak,Alpay Medetalibeyoglu,Matthew Antalek,Yury Velichko,Daniela Ladner,Amir Borhani,Ulas Bagci
2024-08-11
Abstract:Accurately segmenting different organs from medical images is a critical prerequisite for computer-assisted diagnosis and intervention planning. This study proposes a deep learning-based approach for segmenting various organs from CT and MRI scans and classifying diseases. Our study introduces a novel technique integrating momentum within residual blocks for enhanced training dynamics in medical image analysis. We applied our method in two distinct tasks: segmenting liver, lung, & colon data and classifying abdominal pelvic CT and MRI scans. The proposed approach has shown promising results, outperforming state-of-the-art methods on publicly available benchmarking datasets. For instance, in the lung segmentation dataset, our approach yielded significant enhancements over the TransNetR model, including a 5.72% increase in dice score, a 5.04% improvement in mean Intersection over Union (mIoU), an 8.02% improvement in recall, and a 4.42% improvement in precision. Hence, incorporating momentum led to state-of-the-art performance in both segmentation and classification tasks, representing a significant advancement in the field of medical imaging.
Computer Vision and Pattern Recognition,Artificial Intelligence,Machine Learning,Image and Video Processing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the accuracy issue in medical image segmentation and classification. Specifically, accurately segmenting different organs (such as the liver, lungs, and colon) from CT and MRI scans and classifying diseases are crucial prerequisites for computer - aided diagnosis and intervention planning. Existing methods face challenges when dealing with these tasks, especially under the influence of factors such as low contrast, noise, and patient - to - patient variation, resulting in less - than - ideal segmentation and classification accuracy. To address these challenges, this research proposes a momentum - based deep - learning technique. By introducing momentum in the residual blocks to enhance the training dynamics, the performance of medical image analysis is improved. This method aims to enhance the recognition ability of existing models on complex data patterns, thereby achieving more accurate prediction results in segmentation and classification tasks. The following are the main contributions of the paper: 1. **Introduction of the momentum mechanism**: A momentum term is integrated in the residual blocks to optimize network training and improve convergence speed and stability. 2. **Extensive experimental verification**: Through a large number of experiments on multiple publicly available benchmark datasets, the superior performance of the proposed momentum - based method in segmentation and classification tasks is proven. 3. **Significant performance improvement**: For example, in the lung segmentation dataset, compared with the TransNetR model, this method has a 5.72% improvement in the Dice coefficient, a 5.04% improvement in mIoU, an 8.02% improvement in recall, and a 4.42% improvement in precision. In conclusion, this research solves the key problems in medical image segmentation and classification by introducing the momentum mechanism and provides a more reliable tool for clinical applications.