AI performance by mammographic density in a retrospective cohort study of 99,489 participants in BreastScreen Norway

Marie Burns Bergan,Marthe Larsen,Nataliia Moshina,Hauke Bartsch,Henrik Wethe Koch,Hildegunn Siv Aase,Zhanbolat Satybaldinov,Ingfrid Helene Salvesen Haldorsen,Christoph I. Lee,Solveig Hofvind
DOI: https://doi.org/10.1007/s00330-024-10681-z
IF: 7.034
2024-03-25
European Radiology
Abstract:Abstract Objective To explore the ability of artificial intelligence (AI) to classify breast cancer by mammographic density in an organized screening program. Materials and method We included information about 99,489 examinations from 74,941 women who participated in BreastScreen Norway, 2013–2019. All examinations were analyzed with an AI system that assigned a malignancy risk score (AI score) from 1 (lowest) to 10 (highest) for each examination. Mammographic density was classified into Volpara density grade (VDG), VDG1–4; VDG1 indicated fatty and VDG4 extremely dense breasts. Screen-detected and interval cancers with an AI score of 1–10 were stratified by VDG. Results We found 10,406 (10.5% of the total) examinations to have an AI risk score of 10, of which 6.7% (704/10,406) was breast cancer. The cancers represented 89.7% (617/688) of the screen-detected and 44.6% (87/195) of the interval cancers. 20.3% (20,178/99,489) of the examinations were classified as VDG1 and 6.1% (6047/99,489) as VDG4. For screen-detected cancers, 84.0% (68/81, 95% CI, 74.1–91.2) had an AI score of 10 for VDG1, 88.9% (328/369, 95% CI, 85.2–91.9) for VDG2, 92.5% (185/200, 95% CI, 87.9–95.7) for VDG3, and 94.7% (36/38, 95% CI, 82.3–99.4) for VDG4. For interval cancers, the percentages with an AI score of 10 were 33.3% (3/9, 95% CI, 7.5–70.1) for VDG1 and 48.0% (12/25, 95% CI, 27.8–68.7) for VDG4. Conclusion The tested AI system performed well according to cancer detection across all density categories, especially for extremely dense breasts. The highest proportion of screen-detected cancers with an AI score of 10 was observed for women classified as VDG4. Clinical relevance statement Our study demonstrates that AI can correctly classify the majority of screen-detected and about half of the interval breast cancers, regardless of breast density. Key Points • Mammographic density is important to consider in the evaluation of artificial intelligence in mammographic screening. • Given a threshold representing about 10% of those with the highest malignancy risk score by an AI system, we found an increasing percentage of cancers with increasing mammographic density. • Artificial intelligence risk score and mammographic density combined may help triage examinations to reduce workload for radiologists.
radiology, nuclear medicine & medical imaging
What problem does this paper attempt to address?
This paper aims to explore the ability of artificial intelligence (AI) systems to classify breast cancer according to breast density in organized breast cancer screening programs. Specifically, the study focuses on how AI systems evaluate the risk of breast cancer based on mammographic density and explores the performance of this evaluation in different breast density categories. ### Research Background - **Breast Cancer Screening**: Most European countries have implemented mammography screening programs to reduce breast cancer mortality. Since 1996, Norway has launched a national breast cancer screening project - BreastScreen Norway, which invites women aged 50 - 69 to undergo bilateral mammography screening every two years. - **The Importance of Breast Density**: Mammographic density refers to the proportion of fibroglandular tissue to adipose tissue in the breast and is an independent risk factor for breast cancer. Women with higher breast density have a 4 - 6 times higher risk of breast cancer than those with lower breast density. In addition, the sensitivity of mammography screening in women with higher breast density is less than 70%, while it is 85% - 90% in women with lower breast density. ### Research Objectives - **Evaluating the Performance of AI Systems**: The study aims to analyze the ability of AI systems to identify breast cancer in different breast density categories. - **Combining Breast Density and AI Scores**: To explore whether the combination of AI scores and breast density can assist in screening examinations and reduce the workload of radiologists. ### Research Methods - **Data Sources**: The study included data from 99,489 digital mammography examinations from two breast centers in Norway (Rogaland and Hordaland) between 2013 and 2019. - **AI Scores**: The commercial AI system Transpara version 1.7.0 was used to score the risk of malignancy for each view of each breast, with scores ranging from 1 to 10, where 10 represents the highest risk. - **Breast Density Classification**: The Volpara software was used to automatically classify breast density into four levels (VDG1 - VDG4), corresponding to fatty, scattered fibroglandular, heterogeneous dense, and extremely dense breasts respectively. ### Main Results - **AI Scores and Cancer Detection**: - 89.7% of screening - detected cancers and 44.6% of interval cancers were assigned an AI score of 10. - For extremely dense breasts (VDG4), 94.7% of screening - detected cancers and 48.0% of interval cancers were assigned an AI score of 10. - **Performance in Different Breast Density Categories**: - VDG1: 84.0% of screening - detected cancers were assigned an AI score of 10. - VDG2: 88.9% of screening - detected cancers were assigned an AI score of 10. - VDG3: 92.5% of screening - detected cancers were assigned an AI score of 10. - VDG4: 94.7% of screening - detected cancers were assigned an AI score of 10. ### Conclusions - **Performance of AI Systems**: AI systems show good cancer - detecting ability in different breast density categories, especially in extremely dense breasts. - **Clinical Significance**: The study shows that AI can correctly classify most screening - detected cancers and about half of the interval cancers, regardless of breast density. ### Clinical Applications - **Auxiliary Screening**: The combination of AI scores and breast density can assist in screening examinations, reduce the workload of radiologists, and improve screening efficiency. - **Supplementary Screening Recommendations**: For women with extremely dense breasts, it is recommended to undergo additional MRI screening or annual mammography screening. Through these research results, researchers hope to provide more tools and strategies for breast cancer screening, especially to improve the accuracy and efficiency of screening in women with higher breast density.