Quality control of cardiac magnetic resonance imaging segmentation, feature tracking, aortic flow and native T1 analysis using automated batch processing in the UK Biobank Study

Sucharitha Chadalavada,Elisa Rauseo,Ahmed Salih,Hafiz Naderi,Mohammed Khanji,Jose D Vargas,Aaron M Lee,Alborz Amir-Kalili,Lisette Lockhart,Ben Graham,Mihaela Chirvasa,Kenneth Fung,Jose Paiva,Mihir M Sanghvi,Gregory G Slabaugh,Magnus T Jensen,Nay Aung,Steffen E Petersen
DOI: https://doi.org/10.1093/ehjimp/qyae094
2024-09-16
Abstract:Abstract Background Automated algorithms are regularly used to analyse cardiac magnetic resonance (CMR) images. Validating data output reliability from this method is crucial for enabling widespread adoption. We outline a visual quality control (QC) process for image analysis using automated batch processing. We assess the performance of automated analysis and the reliability of replacing visual checks with statistical outlier removal approach in UK Biobank CMR scans. Methods We included 1,987 CMR scans from the UK Biobank COVID imaging study. We used batch processing software (Circle Cardiovascular Imaging Inc. - CVI42) to automatically extract chamber volumetric data, strain, native T1, and aortic flow data. The automated analysis outputs (∼62,000 videos and 2,000 images) were visually checked by six experienced clinicians using a standardised approach and a custom-built R Shiny app. Inter-observer variability was assessed. Data from scans passing visual QC was compared with a statistical outlier removal QC method in a subset of healthy individuals (n = 1069). Results Automated segmentation was highly rated, with over 95% of scans passing visual QC. Overall inter-observer agreement was very good (Gwet’s AC2 0.91; 95% confidence interval [0.84,0.94]). No difference in overall data derived from visual QC or statistical outlier removal in healthy individuals was observed. Conclusion Automated image analysis using CVI42 prototypes for UK Biobank CMR scans demonstrated high quality. Larger UK Biobank datasets analysed using these automated algorithms do not require in-depth visual QC. Statistical outlier removal is sufficient as a QC measure, with operator discretion for visual checks based on population or research objectives.
What problem does this paper attempt to address?