Fully Automatic Assessment of Background Parenchymal Enhancement on Breast MRI Using Machine‐Learning Models

Yoonho Nam,Ga Eun Park,Junghwa Kang,Sung Hun Kim
DOI: https://doi.org/10.1002/jmri.27429
IF: 4.4
2020-11-20
Journal of Magnetic Resonance Imaging
Abstract:<section class="article-section__content"><h3 class="article-section__sub-title section1"> Background</h3><p>Automated measurement and classification models with objectivity and reproducibility are required for accurate evaluation of the breast cancer risk of fibroglandular tissue (FGT) and background parenchymal enhancement (BPE).</p></section><section class="article-section__content"><h3 class="article-section__sub-title section1"> Purpose</h3><p>To develop and evaluate a machine‐learning algorithm for breast FGT segmentation and BPE classification.</p></section><section class="article-section__content"><h3 class="article-section__sub-title section1"> Study Type</h3><p>Retrospective.</p></section><section class="article-section__content"><h3 class="article-section__sub-title section1"> Population</h3><p>A total of 794 patients with breast cancer, 594 patients assigned to the development set, and 200 patients to the test set.</p></section><section class="article-section__content"><h3 class="article-section__sub-title section1"> Field Strength/Sequence</h3><p>3T and 1.5T; T<sub>2</sub>‐weighted, fat‐saturated T<sub>1</sub>‐weighted (T<sub>1</sub>W) with dynamic contrast enhancement (DCE). </p></section><section class="article-section__content"><h3 class="article-section__sub-title section1"> Assessment</h3><p>Manual segmentation was performed for the whole breast and FGT regions in the contralateral breast. The BPE region was determined by thresholding using the subtraction of the pre‐ and postcontrast T<sub>1</sub>W images and the segmented FGT mask. Two radiologists independently assessed the categories of FGT and BPE. A deep‐learning‐based algorithm was designed to segment and measure the volume of whole breast and FGT and classify the grade of BPE. </p></section><section class="article-section__content"><h3 class="article-section__sub-title section1"> Statistical Tests</h3><p>Dice similarity coefficients (DSC) and Spearman correlation analysis were used to compare the volumes from the manual and deep‐learning‐based segmentations. Kappa statistics were used for agreement analysis. Comparison of area under the receiver operating characteristic (ROC) curves (AUC) and F1 scores were calculated to evaluate the performance of BPE classification.</p></section><section class="article-section__content"><h3 class="article-section__sub-title section1"> Results</h3><p>The mean (±SD) DSC for manual and deep‐learning segmentations was 0.85 ± 0.11. The correlation coefficient for FGT volume from manual‐ and deep‐learning‐based segmentations was 0.93. Overall accuracy of manual segmentation and deep‐learning segmentation in BPE classification task was 66% and 67%, respectively. For binary categorization of BPE grade (minimal/mild vs. moderate/marked), overall accuracy increased to 91.5% in manual segmentation and 90.5% in deep‐learning segmentation; the AUC was 0.93 in both methods.</p></section><section class="article-section__content"><h3 class="article-section__sub-title section1"> Data Conclusion</h3><p>This deep‐learning‐based algorithm can provide reliable segmentation and classification results for BPE.</p></section><section class="article-section__content"><h3 class="article-section__sub-title section1"> Level of Evidence</h3><p>3</p></section><section class="article-section__content"><h3 class="article-section__sub-title section1"> Technical Efficacy Stage</h3><p>2</p></section>
radiology, nuclear medicine & medical imaging
What problem does this paper attempt to address?