Abstract:Digital whole slide images (WSIs) are generally captured at microscopic resolution and encompass extensive spatial data. Directly feeding these images to deep learning models is computationally intractable due to memory constraints, while downsampling the WSIs risks incurring information loss. Alternatively, splitting the WSIs into smaller patches may result in a loss of important contextual information. In this paper, we propose a novel dual attention approach, consisting of two main components, both inspired by the visual examination process of a pathologist: The first soft attention model processes a low magnification view of the WSI to identify relevant regions of interest, followed by a custom sampling method to extract diverse and spatially distinct image tiles from the selected ROIs. The second component, the hard attention classification model further extracts a sequence of multi-resolution glimpses from each tile for classification. Since hard attention is non-differentiable, we train this component using reinforcement learning to predict the location of the glimpses. This approach allows the model to focus on essential regions instead of processing the entire tile, thereby aligning with a pathologist's way of diagnosis. The two components are trained in an end-to-end fashion using a joint loss function to demonstrate the efficacy of the model. The proposed model was evaluated on two WSI-level classification problems: Human epidermal growth factor receptor 2 scoring on breast cancer histology images and prediction of Intact/Loss status of two Mismatch Repair biomarkers from colorectal cancer histology images. We show that the proposed model achieves performance better than or comparable to the state-of-the-art methods while processing less than 10% of the WSI at the highest magnification and reducing the time required to infer the WSI-level label by more than 75%.

Whole slide semantic segmentation: large scale active learning for digital pathology

Deep Learning-Based Semantic Segmentation of Non-Melanocytic Skin Tumors in Whole-Slide Histopathological Images.

Data-efficient and weakly supervised computational pathology on whole-slide images

Deep learning for multi-class semantic segmentation enables colorectal cancer detection and classification in digital pathology images

Overcoming the limitations of patch-based learning to detect cancer in whole slide images

Finding Regions of Interest in Whole Slide Images Using Multiple Instance Learning

A Generalized Deep Learning Framework for Whole-Slide Image Segmentation and Analysis

Pathology Image Analysis Using Segmentation Deep Learning Algorithms

Developing image analysis pipelines of whole-slide images: Pre- and post-processing

Efficient Quality Control of Whole Slide Pathology Images with Human-in-the-loop Training

End-to-end Learning for Image-based Detection of Molecular Alterations in Digital Pathology

Democratizing Pathological Image Segmentation with Lay Annotators via Molecular-empowered Learning

Dual Attention Model with Reinforcement Learning for Classification of Histology Whole-Slide Images

Gigapixel Whole-Slide Images Classification using Locally Supervised Learning

Deep Interactive Learning-based ovarian cancer segmentation of H&E-stained whole slide images to study morphological patterns of BRCA mutation

Clinical-grade computational pathology using weakly supervised deep learning on whole slide images

Robust whole slide image analysis for cervical cancer screening using deep learning

Multistain Pretraining for Slide Representation Learning in Pathology

Semantic Segmentation Based Quality Control of Histopathology Whole Slide Images

Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction

PathML: A unified framework for whole-slide image analysis with deep learning