Critical evaluation of artificial intelligence as a digital twin of pathologists for prostate cancer pathology

Okyaz Eminaga,Mahmoud Abbas,Christian Kunder,Yuri Tolkach,Ryan Han,James D. Brooks,Rosalie Nolley,Axel Semjonow,Martin Boegemann,Robert West,Jin Long,Richard E. Fan,Olaf Bettendorf
DOI: https://doi.org/10.1038/s41598-024-55228-w
IF: 4.6
2024-03-06
Scientific Reports
Abstract:Prostate cancer pathology plays a crucial role in clinical management but is time-consuming. Artificial intelligence (AI) shows promise in detecting prostate cancer and grading patterns. We tested an AI-based digital twin of a pathologist, vPatho, on 2603 histological images of prostate tissue stained with hematoxylin and eosin. We analyzed various factors influencing tumor grade discordance between the vPatho system and six human pathologists. Our results demonstrated that vPatho achieved comparable performance in prostate cancer detection and tumor volume estimation, as reported in the literature. The concordance levels between vPatho and human pathologists were examined. Notably, moderate to substantial agreement was observed in identifying complementary histological features such as ductal, cribriform, nerve, blood vessel, and lymphocyte infiltration. However, concordance in tumor grading decreased when applied to prostatectomy specimens (κ = 0.44) compared to biopsy cores (κ = 0.70). Adjusting the decision threshold for the secondary Gleason pattern from 5 to 10% improved the concordance level between pathologists and vPatho for tumor grading on prostatectomy specimens (κ from 0.44 to 0.64). Potential causes of grade discordance included the vertical extent of tumors toward the prostate boundary and the proportions of slides with prostate cancer. Gleason pattern 4 was particularly associated with this population. Notably, the grade according to vPatho was not specific to any of the six pathologists involved in routine clinical grading. In conclusion, our study highlights the potential utility of AI in developing a digital twin for a pathologist. This approach can help uncover limitations in AI adoption and the practical application of the current grading system for prostate cancer pathology.
multidisciplinary sciences
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the challenges in prostate cancer pathology assessment, especially how to use artificial intelligence (AI) technology as a digital twin of pathologists to improve the efficiency and accuracy of prostate cancer detection, tumor volume estimation, and tumor grading. Specifically: 1. **Improving the efficiency of pathological assessment**: Traditional manual segmentation and grading of prostate cancer tissues are time - consuming and labor - intensive, especially when dealing with prostatectomy specimens. The paper explores the performance of the AI system (vPatho) in these tasks, aiming to accelerate the clinical workflow through an automated process. 2. **Improving the accuracy of pathological assessment**: The paper analyzes the consistency level between vPatho and six human pathologists in prostate cancer detection, tumor volume estimation, and tumor grading. The study found that vPatho performs well in detecting prostate cancer and estimating tumor volume, but there is a certain degree of inconsistency in tumor grading, especially when dealing with prostatectomy specimens. 3. **Identifying and solving the limitations of the current grading system**: The paper explores the reasons for the grading inconsistency between vPatho and human pathologists, including the degree of vertical extension of the tumor to the prostate boundary, the proportion of slices containing prostate cancer, and the particularity of Gleason pattern 4. The study found that adjusting the decision threshold of the secondary Gleason pattern (from 5% to 10%) can significantly improve the grading consistency between vPatho and pathologists. 4. **Evaluating the generalization ability of the AI system**: The paper also tests the performance of vPatho on different datasets, including slice images obtained from different studies, to verify its generalization ability and the feasibility of practical application. In summary, this paper aims to reveal the advantages and limitations of the AI system vPatho in prostate cancer pathology assessment by evaluating its performance, and to put forward suggestions for improving the existing grading system, so as to promote the wide application of AI technology in clinical practice.