Classification of Breast Cancer Histopathology Images using a Modified Supervised Contrastive Learning Method

Matina Mahdizadeh Sani,Ali Royat,Mahdieh Soleymani Baghshah
2024-09-24
Abstract:Deep neural networks have reached remarkable achievements in medical image processing tasks, specifically in classifying and detecting various diseases. However, when confronted with limited data, these networks face a critical vulnerability, often succumbing to overfitting by excessively memorizing the limited information available. This work addresses the challenge mentioned above by improving the supervised contrastive learning method leveraging both image-level labels and domain-specific augmentations to enhance model robustness. This approach integrates self-supervised pre-training with a two-stage supervised contrastive learning strategy. In the first stage, we employ a modified supervised contrastive loss that not only focuses on reducing false negatives but also introduces an elimination effect to address false positives. In the second stage, a relaxing mechanism is introduced that refines positive and negative pairs based on similarity, ensuring that only relevant image representations are aligned. We evaluate our method on the BreakHis dataset, which consists of breast cancer histopathology images, and demonstrate an increase in classification accuracy by 1.45% in the image level, compared to the state-of-the-art method. This improvement corresponds to 93.63% absolute accuracy, highlighting the effectiveness of our approach in leveraging properties of data to learn more appropriate representation space.
Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve This paper aims to address several key issues in the classification of breast cancer histopathological images: 1. **Data Scarcity Issue**: In the medical field, labeled data is limited, posing challenges for supervised learning methods. To overcome this issue, the researchers draw on self-supervised learning techniques and utilize the similarity of labeled data and images in the representation space to improve the model. 2. **Overfitting Issue**: When faced with limited data, deep neural networks are prone to overfitting. This paper improves the supervised contrastive learning method by combining image-level labels and domain-specific data augmentation techniques to enhance the model's robustness. 3. **Generalization Ability Issue**: To improve the model's generalization ability, the researchers employ various data augmentation techniques specific to histopathological datasets and HE staining. ### Main Contributions - Proposed a two-stage supervised contrastive learning strategy. The first stage improves the supervised contrastive loss function by not only reducing false negatives but also introducing a mechanism to eliminate false positives. The second stage introduces a relaxation mechanism to adjust positive and negative sample pairs based on similarity. - Conducted experimental evaluation on the BreakHis dataset, achieving a 1.45% improvement in image-level classification accuracy, reaching an absolute accuracy of 93.63%. - The study demonstrates that the proposed method can effectively utilize data attributes to learn a more suitable representation space. ### Method Overview - Utilized SimCLR technology to pre-train the model and used these pre-trained weights as the initial weights for the representation learning stage. - The representation learning stage includes two supervised contrastive learning stages, ultimately completing the classification task through supervised fine-tuning. - Introduced various data augmentation techniques such as random cropping, color jittering, and Gaussian blur to enhance the model's generalization ability. - In the relaxation stage, adjusted positive and negative sample pairs based on similarity in the representation space to further optimize the model. - The fine-tuning stage includes a primary classification task and an auxiliary task to improve the model's robustness to different HE staining. ### Experimental Results - Achieved superior image-level and patient-level accuracy on the BreakHis dataset compared to existing methods, with improvements of 1.45% and 0.31%, respectively. - Validated the model's generalization ability using the BACH dataset, achieving over 90% accuracy even with limited training.