Renal Cell Carcinoma subtyping: learning from multi-resolution localization

Mohamad Mohamad,Francesco Ponzio,Santa Di Cataldo,Damien Ambrosetti,Xavier Descombes
2024-11-14
Abstract:Renal Cell Carcinoma is typically asymptomatic at the early stages for many patients. This leads to a late diagnosis of the tumor, where the curability likelihood is lower, and makes the mortality rate of Renal Cell Carcinoma high, with respect to its incidence rate. To increase the survival chance, a fast and correct categorization of the tumor subtype is paramount. Nowadays, computerized methods, based on artificial intelligence, represent an interesting opportunity to improve the productivity and the objectivity of the microscopy-based Renal Cell Carcinoma diagnosis. Nonetheless, much of their exploitation is hampered by the paucity of annotated dataset, essential for a proficient training of supervised machine learning technologies. This study sets out to investigate a novel self supervised training strategy for machine learning diagnostic tools, based on the multi-resolution nature of the histological samples. We aim at reducing the need of annotated dataset, without significantly reducing the accuracy of the tool. We demonstrate the classification capability of our tool on a whole slide imaging dataset for Renal Cancer subtyping, and we compare our solution with several state-of-the-art classification counterparts.
Computer Vision and Pattern Recognition,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve several key challenges in the subtype classification of renal cell carcinoma (Renal Cell Carcinoma, RCC): 1. **Difficulty in early diagnosis**: RCC is usually asymptomatic in the early stage, which causes many patients to be diagnosed at an advanced stage, reducing the possibility of cure and increasing the mortality rate. Therefore, rapid and accurate classification of tumor subtypes is crucial for improving survival rates. 2. **Insufficient labeled data**: Existing computer - based methods based on supervised learning require a large amount of labeled data for effective model training. However, in the field of pathology, obtaining sufficient labeled data is very time - consuming and error - prone, because it requires pathologists to conduct detailed visual inspections and regional divisions (Region Of Interest, ROI - cropping procedure) on each whole - slide digital image (Whole Slide Images, WSIs). 3. **Limitations of existing methods**: - Many existing studies only consider two or three main malignant RCC subtypes (such as clear cell renal cell carcinoma, papillary renal cell carcinoma, and chromophobe cell renal cell carcinoma), while ignoring benign tumors such as oncocytoma. - Most of the existing automatic classification frameworks rely on large - scale labeled data sets, which limits their promotion in practical applications. - Although self - supervised learning (Self - Supervised Learning, SSL) has performed well in other fields, its application on histopathological images has not been fully explored. To solve these problems, this study proposes a self - supervised learning strategy based on multi - resolution localization, aiming to reduce the need for a large amount of labeled data while maintaining high classification accuracy. Specifically, the researchers designed a new SSL task to predict the position of high - resolution patches in low - resolution images by learning image features at different magnifications. This method can not only effectively utilize unlabeled data but also achieve performance comparable to fully - supervised methods with limited labeled data. ### Key innovation points - **Multi - resolution learning**: Combine image features at different magnifications to simulate the process of pathologists observing and judging at different magnifications. - **Self - supervised pre - training**: By designing specific pre - training tasks (such as predicting the position of high - resolution patches), the dependence on large - scale labeled data is reduced. - **Robustness and extensibility**: The experimental results show that this method can still maintain good performance when the amount of labeled data is reduced, demonstrating its potential in practical applications. Through these innovations, the researchers hope to provide a more efficient, accurate, and practical method for RCC subtype classification, thereby improving the diagnosis and treatment effects of patients.