Self-supervised learning of a tailored Convolutional Auto Encoder for histopathological prostate grading

Zahra Tabatabaei,Adrian colomer,Kjersti Engan,Javier Oliver,Valery Naranjo
2023-03-21
Abstract:According to GLOBOCAN 2020, prostate cancer is the second most common cancer in men worldwide and the fourth most prevalent cancer overall. For pathologists, grading prostate cancer is challenging, especially when discriminating between Grade 3 (G3) and Grade 4 (G4). This paper proposes a Self-Supervised Learning (SSL) framework to classify prostate histopathological images when labeled images are scarce. In particular, a tailored Convolutional Auto Encoder (CAE) is trained to reconstruct 128x128x3 patches of prostate cancer Whole Slide Images (WSIs) as a pretext task. The downstream task of the proposed SSL paradigm is the automatic grading of histopathological patches of prostate cancer. The presented framework reports promising results on the validation set, obtaining an overall accuracy of 83% and on the test set, achieving an overall accuracy value of 76% with F1-score of 77% in G4.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address the challenges in grading histopathological images of prostate cancer, particularly how to improve the accuracy of automatic classification when labeled data is scarce. Specifically, the study focuses on distinguishing between Grade 3 (G3) and Grade 4 (G4) prostate cancer tissue samples, which have subtle differences but require different clinical treatments (G3 requires medication while G4 usually requires surgery), making accurate differentiation crucial. To tackle this issue, the authors propose a method based on Self-Supervised Learning (SSL), which utilizes a customized Convolutional Auto Encoder (CAE) for pre-training to reconstruct image patches of Whole Slide Images (WSIs) of prostate cancer. The main advantage of this approach is its ability to learn useful information from unlabeled data, which is particularly beneficial in situations where labeled data is scarce. After the pre-training phase, the extracted features are used for downstream tasks—namely, the automatic grading of histopathological image patches of prostate tissue. In this way, the paper aims to develop a new self-supervised learning framework that can accurately perform multi-grade classification of prostate cancer tissue with limited labeled data, with a particular emphasis on improving the classification performance between G3 and G4. Experimental results show that the proposed model achieved an overall accuracy of 83% on the validation set and 76% on the test set, with an F1 score of 77%, and performed particularly well at the G4 level. This indicates that the method can effectively address the key challenges in grading prostate cancer tissue.