Abstract:Abstract Digital analysis of pathology whole-slide images is fast becoming a game changer in cancer diagnosis and treatment. Specifically, deep learning methods have shown great potential to support pathology analysis, with recent studies identifying molecular traits that were not previously recognized in pathology H&E whole-slide images. Simultaneous to these developments, it is becoming increasingly evident that tumor heterogeneity is an important determinant of cancer prognosis and susceptibility to treatment, and should therefore play a role in the evolving practices of matching treatment protocols to patients. State of the art diagnostic procedures, however, do not provide automated methods for characterizing and/or quantifying tumor heterogeneity, certainly not in a spatial context. Further, existing methods for analyzing pathology whole-slide images from bulk measurements require many training samples and complex pipelines. Our work addresses these two challenges. First, we train deep learning models to spatially resolve bulk mRNA and miRNA expression levels on pathology whole-slide images (WSIs). Our models reach up to 0.95 AUC on held-out test sets from two cancer cohorts using a simple training pipeline and a small number of training samples. Using the inferred gene expression levels, we further develop a method to spatially characterize tumor heterogeneity. Specifically, we produce tumor molecular cartographies and heterogeneity maps of WSIs and formulate a heterogeneity index (HTI) that quantifies the level of heterogeneity within these maps. Applying our methods to breast and lung cancer slides, we show a significant statistical link between heterogeneity and survival. Our methods potentially open a new and accessible approach to investigating tumor heterogeneity and other spatial molecular properties and their link to clinical characteristics, including treatment susceptibility and survival.

Transcriptomics-guided Slide Representation Learning in Computational Pathology

Multistain Pretraining for Slide Representation Learning in Pathology

Unsupervised Foundation Model-Agnostic Slide-Level Representation Learning

A self-supervised framework for learning whole slide representations

Slide-based Graph Collaborative Training for Histopathology Whole Slide Image Analysis

Spatial transcriptomics inferred from pathology whole-slide images links tumor heterogeneity to survival in breast and lung cancer

Contrastive Multiple Instance Learning: An Unsupervised Framework for Learning Slide-Level Representations of Whole Slide Histopathology Images without Labels

The Whole Pathological Slide Classification via Weakly Supervised Learning

Multimodal Whole Slide Foundation Model for Pathology

Giga-SSL: Self-Supervised Learning for Gigapixel Images

SlideGCD: Slide-based Graph Collaborative Training with Knowledge Distillation for Whole Slide Image Classification

Morphological Prototyping for Unsupervised Slide Representation Learning in Computational Pathology

Data-efficient and weakly supervised computational pathology on whole-slide images

SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding

PathAlign: A vision-language model for whole slide images in histopathology

Finding Regions of Interest in Whole Slide Images Using Multiple Instance Learning

Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction

Topological Feature Extraction and Visualization of Whole Slide Images using Graph Neural Networks

Slideflow: deep learning for digital histopathology with real-time whole-slide visualization