Cell Maps Representation For Lung Adenocarcinoma Growth Patterns Classification In Whole Slide Images

Arwa Al-Rubaian,Gozde N. Gunesli,Wajd A. Althakfi,Ayesha Azam,Nasir Rajpoot,Shan E Ahmed Raza
2024-05-16
Abstract:Lung adenocarcinoma is a morphologically heterogeneous disease, characterized by five primary histologic growth patterns. The quantity of these patterns can be related to tumor behavior and has a significant impact on patient prognosis. In this work, we propose a novel machine learning pipeline capable of classifying tissue tiles into one of the five patterns or as non-tumor, with an Area Under the Receiver Operating Characteristic Curve (AUCROC) score of 0.97. Our model's strength lies in its comprehensive consideration of cellular spatial patterns, where it first generates cell maps from Hematoxylin and Eosin (H&E) whole slide images (WSIs), which are then fed into a convolutional neural network classification model. Exploiting these cell maps provides the model with robust generalizability to new data, achieving approximately 30% higher accuracy on unseen test-sets compared to current state of the art approaches. The insights derived from our model can be used to predict prognosis, enhancing patient outcomes.
Image and Video Processing,Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the automatic classification of the growth patterns of lung adenocarcinoma (LUAD). Specifically, the paper focuses on how to accurately identify and classify the five main histological growth patterns of LUAD from whole slide images (WSIs) through machine - learning techniques, especially deep - learning methods: lepidic, acinar, papillary, micropapillary and solid. These growth patterns are related to the behavior of tumors and have a significant impact on the prognosis of patients. Therefore, accurate classification of these patterns is of great significance for improving the accuracy and objectivity of pathological diagnosis and for improving the prognosis of patients. The paper points out that traditional visual assessment methods are subjective and inconsistent, especially when dealing with mixed patterns, tumor heterogeneity and time constraints. In addition, most of the existing machine - learning algorithms perform well when evaluating the main pattern of a single WSI, but their performance is insufficient at the tile - level evaluation. This may be because the aggregation method ignores the wrong tiles when generating slide - level predictions. Therefore, the paper proposes a new machine - learning pipeline, aiming to improve the classification of LUAD growth patterns by generating cell maps, thus providing stronger generalization ability, especially when facing unseen data sets. The main contribution of this study lies in the introduction of the concept of cell maps for predicting pathology - specific tasks, which can increase the generalization ability of current machine - learning methods. The experimental results show that the proposed model significantly outperforms the existing state - of - the - art methods on the unseen test set, especially when verified by using the data - splitting method based on WSI.