Image Quality Assessment of Fetal Brain MRI Using Multi‐Instance Deep Learning Methods
Axel Largent,Kushal Kapse,Scott D. Barnett,Josepheen De Asis‐Cruz,Matthew Whitehead,Jonathan Murnick,Li Zhao,Nicole Andersen,Jessica Quistorff,Catherine Lopez,Catherine Limperopoulos
DOI: https://doi.org/10.1002/jmri.27649
IF: 4.4
2021-04-23
Journal of Magnetic Resonance Imaging
Abstract:BackgroundDue to random motion of fetuses and maternal respirations, image quality of fetal brain MRIs varies considerably. To address this issue, visual inspection of the images is performed during acquisition phase and after 3D‐reconstruction, and the images are re‐acquired if they are deemed to be of insufficient quality. However, this process is time‐consuming and subjective. Multi‐instance (MI) deep learning methods (DLMs) may perform this task automatically. PurposeTo propose an MI count‐based DLM (MI‐CB‐DLM), an MI vote‐based DLM (MI‐VB‐DLM), and an MI feature‐embedding DLM (MI‐FE‐DLM) for automatic assessment of 3D fetal‐brain MR image quality. To quantify influence of fetal gestational age (GA) on DLM performance. Study typeRetrospective. SubjectsTwo hundred and seventy‐one MR exams from 211 fetuses (mean GA ± SD = 30.9 ± 5.5 weeks). Field Strength/SequenceT2‐weighted single‐shot fast spin‐echo acquired at 1.5 T. AssessmentThe T2‐weighted images were reconstructed in 3D. Then, two fetal neuroradiologists, a clinical neuroscientist, and a fetal MRI technician independently labeled the reconstructed images as 1 or 0 based on image quality (1 = high; 0 = low). These labels were fused and served as ground truth. The proposed DLMs were trained and evaluated using three repeated 10‐fold cross‐validations (training and validation sets of 244 and 27 scans). To quantify GA influence, this variable was included as an input of the DLMs. Statistical TestsDLM performance was evaluated using precision, recall, F‐score, accuracy, and AUC values. ResultsPrecision, recall, F‐score, accuracy, and AUC averaged over the three cross validations were 0.85 ± 0.01, 0.85 ± 0.01, 0.85 ± 0.01, 0.85 ± 0.01, 0.93 ± 0.01, for MI‐CB‐DLM (without GA); 0.75 ± 0.03, 0.75 ± 0.03, 0.75 ± 0.03, 0.75 ± 0.03, 0.81 ± 0.03, for MI‐VB‐DLM (without GA); 0.81 ± 0.01, 0.81 ± 0.01, 0.81 ± 0.01, 0.81 ± 0.01, 0.89 ± 0.01, for MI‐FE‐DLM (without GA); and 0.86 ± 0.01, 0.86 ± 0.01, 0.86 ± 0.01, 0.86 ± 0.01, 0.93 ± 0.01, for MI‐CB‐DLM with GA. Data ConclusionMI‐CB‐DLM performed better than other DLMs. Including GA as an input of MI‐CB‐DLM improved its performance. MI‐CB‐DLM may potentially be used to objectively and rapidly assess fetal MR image quality. Evidence Level4 Technical EfficacyStage 3
radiology, nuclear medicine & medical imaging