Study Protocol: Development and Retrospective Validation of an Artificial Intelligence System for Diagnostic Assessment of Prostate Biopsies
Nita Mulliqi,Anders Blilie,Xiaoyi Ji,Kelvin Szolnoky,Henrik Olsson,Matteo Titus,Geraldine Martinez Gonzalez,Sol Erika Boman,Masi Valkonen,Einar Gudlaugsson,Svein R Kjosavik,Jose Asenjo,Marcello Gambacorta,Paolo Libretti,Marcin Braun,Radzislaw Kordek,Roman Lowicki,Kristina Hotakainen,Paivi Vare,Bodil Ginnerup Pedersen,Karina Dalsgaard Sorensen,Benedicte Parm Ulhoi,Mattias Rantalainen,Pekka Ruusuvuori,Brett Delahunt,Hemamali Samaratunga,Toyonori Tsuzuki,Emilius A.M. Janssen,Lars Egevad,Kimmo Kartasalo,Martin Eklund
DOI: https://doi.org/10.1101/2024.07.04.24309948
2024-07-07
Abstract:Histopathological evaluation of prostate biopsies using the Gleason scoring system is critical for prostate cancer diagnosis and treatment selection. However, grading variability among pathologists can lead to inconsistent assessments, risking inappropriate treatment. Similar challenges complicate the assessment of other prognostic features like cribriform cancer morphology and perineural invasion. Many pathology departments are also facing an increasingly unsustainable workload due to rising prostate cancer incidence and a decreasing pathologist workforce coinciding with increasing requirements for more complex assessments and reporting.
Digital pathology and artificial intelligence (AI) algorithms for analysing whole slide images (WSI) show promise in improving the accuracy and efficiency of histopathological assessments. Studies have demonstrated AI's capability to diagnose and grade prostate cancer comparably to expert pathologists. However, external validations on diverse data sets have been limited and often show reduced performance. Historically, there have been no well-established guidelines for AI study designs and validation methods. Diagnostic assessments of AI systems often lack pre-registered protocols and rigorous external cohort sampling, essential for reliable evidence of their safety and accuracy.
This study protocol covers the retrospective validation of an AI system for prostate biopsy assessment. The primary objective of the study is to develop a high-performing and robust AI model for diagnosis and Gleason scoring of prostate cancer in core needle biopsies, and at scale evaluate whether it can generalise to fully external data from independent patients, pathology laboratories, and digitalisation platforms. The secondary objectives cover AI performance in estimating cancer extent and in detecting cribriform prostate cancer and perineural invasion. This protocol outlines the steps for data collection, predefined partitioning of data cohorts for AI model training and validation, model development, and predetermined statistical analyses, ensuring systematic development and comprehensive validation of the system. The protocol adheres to TRIPOD+AI, PIECES, CLAIM, and other relevant best practices.
Pathology