Abstract:Hematoxylin and eosin (H&E) stained histologic sections contain invaluable information that remains largely untapped because of its complexity. To this end, AI applications employing deep learning (DL) can facilitate the translation of image data to enable human interpretation and yield novel oncological insights that would have otherwise remained imperceptible. DL-based methodologies are multimodal, capable of integrating imaging with clinicogenomic data to furnish a more holistic perspective and affording more accurate predictions in oncology. Here, we developed an unsupervised DL workflow to analyze 1,799 H&E images of lung cancer (NSCLC n = 951; SCLC n = 50; others n = 798) incorporating comprehensive patient-level clinical data (electronic health records [ConcertAI]) integrated with genomics (WES and RNA-seq [Caris Labs]). There are three steps in our approach: (1) image preprocessing and filtering, yielding > 30 million image patches; (2) utilizing pretrained SimCLR models from 57 public oncology histopathology datasets with ResNet-18 as a backbone structure to extract 512-dimensional-feature vectors for each patch; (3) using three unsupervised clustering methods (kmeans, DBSCAN, Leiden clustering) to cluster patches and selected Leiden clustering. We identified 635 primary imaging clusters using an elbow method and generated an image feature matrix by calculating correlations between each patch and cluster centroids; these were aggregated and mapped back to source slides. In this proof-of-concept, distinct image feature patterns characterized SCLC and NSCLC samples. For SCLC, one of the salient features was the presence of hemorrhage, which may be associated with higher rates of fine-needle aspiration biopsy procedure for SCLC compared with NSCLC which was confirmed in the EHR data (p = 0.032). Derived morphological clusters were correlated with tumor-immune genomic features (Tumor Mutational Burden [TMB], Immunologic Constant of Rejection [ICR], and Miracle scores1) serving as predictors of response to immune-checkpoint inhibitor therapy. By applying linear models, we detected 11, 96 and 249 significantly associated imaging clusters, respectively, highly enriched with immune cells e.g., plasma cells, macrophages, lymphocytes, and supporting an infiltrated and inflamed tumor-immune microenvironment. In summary, a multimodal, unsupervised deep learning workflow combining H&E imaging with clinicogenomic data was developed to identify histologic feature clusters associated with well-established tumor-immune genomic signatures of NSCLC immune infiltration and molecular phenotypes. These studies demonstrate enormous potential to yield histopathological and translational insights in NSCLC and SCLC that can empower clinicians to make better therapeutic response predictions. Citation Format: Si Wu, Yujie Zhao, Hugo Luo, Kevin Kolahi, Thanh Bui, Xu Shi, Aditee Shrotre, Alexander Liede, Xi Zhao, Josue Samayoa, Weilong Zhao. Integrating real-world histopathological and clinicogenomic data from 1799 lung cancer patients by applying unsupervised deep learning [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2024; Part 1 (Regular s); 2024 Apr 5-10; San Diego, CA. Philadelphia (PA): AACR; Cancer Res 2024;84(6_Suppl) nr 2310.

Deep learning-based six-type classifier for lung cancer and mimics from histopathological whole slide images: a retrospective study

Deep Learning‐based Classification and Spatial Prognosis Risk Score on Whole‐slide Images of Lung Adenocarcinoma

Abstract 2803: Classification of Lung Cancer Histology Images Using Deep Learning

Deep Learning Facilitates Distinguishing Histologic Subtypes of Pulmonary Neuroendocrine Tumors on Digital Whole-Slide Images

[Pathological diagnosis of lung cancer based on deep transfer learning]

Pathologist-level classification of histologic patterns on resected lung adenocarcinoma slides with deep neural networks

Deep learning classification of lung cancer histology using CT images

E2EFP-MIL: End-to-end and high-generalizability weakly supervised deep convolutional network for lung cancer classification from whole slide image

Deep learning-based diagnosis of histopathological patterns for invasive non-mucinous lung adenocarcinoma using semantic segmentation

Abstract 2310: Integrating real-world histopathological and clinicogenomic data from 1799 lung cancer patients by applying unsupervised deep learning

Automated identification of malignancy in whole-slide pathological images: identification of eyelid malignant melanoma in gigapixel pathological slides using deep learning

Classification and mutation prediction from non–small cell lung cancer histopathology images using deep learning

Deep learning-based classification of breast cancer molecular subtypes from H&E whole-slide images

A whole-slide image (WSI)-based immunohistochemical feature prediction system improves the subtyping of lung cancer

Weakly Supervised Deep Learning for Whole Slide Lung Cancer Image Analysis

Lung cancer subtype classification using histopathological images based on weakly supervised multi-instance learning

Deep Learning for Lung Cancer Diagnosis, Prognosis and Prediction Using Histological and Cytological Images: A Systematic Review

Abstract LB243: Deep learning-based molecular characterization of lung cancers from never smokers using hematoxylin and eosin-stained whole slide images

Computer-aided diagnosis of lung carcinoma using deep learning - a pilot study

Classification of Mouse Lung Metastatic Tumor with Deep Learning

Deep Learning-Based Classification of Hepatocellular Nodular Lesions on Whole-Slide Histopathologic Images