Protocol for evaluating the fitness for purpose of an artificial intelligence product for radiology reporting in the BreastScreen New South Wales breast cancer screening programme

Warner-Smith,M.,Ren,K.,Mistry,C.,Walton,R.,Roder,D.,Bhola,N.,McGill,S.,OBrien,T. A.

DOI: https://doi.org/10.1136/bmjopen-2023-082350

IF: 3.006

2024-05-29

BMJ Open

Abstract:Introduction Radiologist shortages threaten the sustainability of breast cancer screening programmes. Artificial intelligence (AI) products that can interpret mammograms could mitigate this risk. While previous studies have suggested this technology has accuracy comparable to radiologists most have been limited by using 'enriched' datasets and/or not considering the interaction between the algorithm and human readers. This study will address these limitations by comparing the accuracy of a workflow using AI alongside radiologists on a large consecutive cohort of examinations from a breast cancer screening programme. The study will combine the strengths of a large retrospective design with the benefit of prospective data collection. It will test this technology without risk to screening programme participants nor the need to wait for follow-up data. With a sample of 2 years of consecutive screening examinations, it is likely the largest test of this technology to date. The study will help determine whether this technology can safely be introduced into the BreastScreen New South Wales (NSW) population-based screening programme to address radiology workforce risks without compromising cancer detection rates or increasing false-positive recalls. Methods and analysis A retrospective, consecutive cohort of digital mammography screens from 658 207 examinations from BreastScreen NSW will be reinterpreted by the Lunit Insight MMG AI product. The cohort includes 4383 screen-detected and 1171 interval cancers. The results will be compared with radiologist single reading and the AI results will also be used to replace the second reader in a double-reading model. New adjudication reading will be performed where the AI disagrees with the first reader. Recall rates and cancer detection rates of combined AI–radiologist reading will be compared with the rates obtained at the time of screening. Ethics and dissemination This study has ethical approval from the NSW Health Population Health Services Research Ethics Committee (2022/ETH02397). Findings will be published in peer-reviewed journals and presented at conferences. The findings of this evaluation will be provided to programme managers, governance bodies and other stakeholders in Australian breast cancer screening programmes.

medicine, general & internal

What problem does this paper attempt to address?

The main problem that this paper attempts to solve is to evaluate the applicability and effectiveness of artificial intelligence products in breast cancer screening programs, especially for the interpretation of radiology reports. Specifically, the research aims to: 1. **Evaluate the accuracy of AI products**: By comparing with the average accuracy of a single radiologist, use a series of indicators such as sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) to measure the performance of AI products. 2. **Compare the accuracy of the standard process and the AI - assisted process**: The research will compare the standard double - blind reading process (including independent arbitration reading) of the BreastScreen NSW (BSNSW) in New South Wales with the process of using AI to replace one of the initial readers, and use routine screening indicators such as radiology reading workload, the number of examinations requiring diagnostic evaluation, cancer detection rate and interval cancer rate for evaluation. 3. **Determine the impact of introducing AI on the screening program**: The research will explore whether AI technology can be safely introduced into the population - based screening program of BSNSW to address the shortage of radiologists without reducing the cancer detection rate or increasing the false - positive recall rate. Through these objectives, the research aims to provide a scientific basis for the practical application of AI technology in breast cancer screening, especially its performance on large - scale real - world data sets, thereby providing reliable information for decision - makers regarding the application of AI technology in clinical practice.

Protocol for evaluating the fitness for purpose of an artificial intelligence product for radiology reporting in the BreastScreen New South Wales breast cancer screening programme

Clinical use of the tetrazolium test for evaluation of potential cadaveric donors for renal homotransplantation.

AI integration improves breast cancer screening in a real-world, retrospective cohort study

Use of artificial intelligence for image analysis in breast cancer screening programmes: systematic review of test accuracy

Impact of Different Mammography Systems on Artificial Intelligence Performance in Breast Cancer Screening

Career trajectory nursing: the Randice Approach.

Comparison of AI-integrated pathways with human-AI interaction for population mammographic screening

Comparison of AI-integrated pathways with human-AI interaction in population mammographic screening for breast cancer

International evaluation of an AI system for breast cancer screening

Evaluation of Combined Artificial Intelligence and Radiologist Assessment to Interpret Screening Mammograms

Familiarity, confidence and preference of artificial intelligence feedback and prompts by Australian breast cancer screening readers

Can we reduce the workload of mammographic screening by automatic identification of normal exams with artificial intelligence? A feasibility study

Artificial Intelligence for Reducing Workload in Breast Cancer Screening with Digital Breast Tomosynthesis

Abstract OT3-18-03: The PRAIM study: A prospective multicenter observational study of an integrated Artificial Intelligence system with live monitoring

A prospective study of amylase-rich pleural effusions with special reference to amylase isoenzyme analysis.

Potential Impact of an Artificial Intelligence-based Mammography Triage Algorithm on Performance and Workload in a Population-based Screening Sample

BT09 Multicentre prospective clinical performance analysis of an artificial intelligence as a medical device deployed within urgent suspected skin cancer pathways

Abstract 3449: Is AI-enhanced breast ultrasound ready for breast cancer screening in low-resource environments? A systematic review

Diagnostic capabilities of artificial intelligence as an additional reader in a breast cancer screening program

Improving frequency measurements: switching loads

Early Indicators of the Impact of Using AI in Mammography Screening for Breast Cancer