Deep learning for histopathological subtyping and grading of lung adenocarcinoma
H. Sakanashi,S. Berezowska,K. Tabata,T. Miyazaki,Yuka Kitamura,R. Attanoos,W. Uegami,K. Ishida,L. Brčić,Shinsuke Yamaoka,A. Marchevsky,A. Roden,J. English,A. Bychkov,S. Morimoto,A. Fabro,Kris Lami,T. Nagayasu,Frank Schneider,A. Takano,Keitaro Matsumoto,Yukio Kashima,Brandon T. Larsen,Tomonori Tanaka,Noriaki Ota,Junya Fukuoka,Maxwell L. Smith,M. Ozasa,A. Cavazza,T. Tsuchiya
DOI: https://doi.org/10.1101/2022.11.02.514811
2022-11-03
bioRxiv
Abstract:The histopathological distinction of lung adenocarcinoma (LADC) subtypes is subject to high inter-observer variability, which can compromise the optimal assessment of the patient prognosis. Therefore, this study developed convolutional neural networks (CNNs) capable of distinguishing LADC subtypes and predicting disease-specific survival, according to the LADC tumour grades established recently by the International Association for the Study of Lung Cancer pathology committee. Consensus LADC ground truth histopathological images were obtained from seventeen expert pulmonary pathologists and one pathologist in training. Two deep learning models (AI-1 and AI-2) were trained with EfficientNet b3 architecture to predict eight different LADC classes (lepidic, acinar, papillary, micropapillary, solid, invasive mucinous adenocarcinoma, other carcinoma types, and no carcinoma cells). Furthermore, the trained models were tested on an independent cohort of 133 patients. The models achieved high precision, recall, and F1-scores exceeding 0.90 for most of the LADC classes. Clear stratification of the three LADC grades was reached in predicting the disease-specific survival by the two models. Moreover, the grading prediction of one of the trained models was more accurate than those of 14 out of 15 pulmonary pathologists involved in the study (p=0.0003). Both trained models showed high stability in the segmentation of each pair of predicted grades with low variation in the hazard ratio across 200 bootstrapped samples. These findings indicate that the trained CNNs improve the diagnostic accuracy of the pathologist, standardise LADC subtype recognition, and refine LADC grade assessment. Thus, the trained models are promising tools that may assist in the routine evaluation of LADC subtypes and grades in clinical practice.
Biology,Computer Science,Medicine