Automated quantitative MRI volumetry reports support diagnostic interpretation in dementia: a multi-rater, clinical accuracy study

Hugh G. Pemberton,Olivia Goodkin,Ferran Prados,Ravi K. Das,Sjoerd B. Vos,James Moggridge,William Coath,Elizabeth Gordon,Ryan Barrett,Anne Schmitt,Hefina Whiteley-Jones,Christian Burd,Mike P. Wattjes,Sven Haller,Meike W. Vernooij,Lorna Harper,Nick C. Fox,Ross W. Paterson,Jonathan M. Schott,Sotirios Bisdas,Mark White,Sebastien Ourselin,John S. Thornton,Tarek A. Yousry,M. Jorge Cardoso,Frederik Barkhof,
DOI: https://doi.org/10.1007/s00330-020-07455-8
IF: 7.034
2021-01-15
European Radiology
Abstract:Abstract Objectives We examined whether providing a quantitative report (QReport) of regional brain volumes improves radiologists’ accuracy and confidence in detecting volume loss, and in differentiating Alzheimer’s disease (AD) and frontotemporal dementia (FTD), compared with visual assessment alone. Methods Our forced-choice multi-rater clinical accuracy study used MRI from 16 AD patients, 14 FTD patients, and 15 healthy controls; age range 52–81. Our QReport was presented to raters with regional grey matter volumes plotted as percentiles against data from a normative population ( n = 461). Nine raters with varying radiological experience (3 each: consultants, registrars, ‘non-clinical image analysts’) assessed each case twice (with and without the QReport). Raters were blinded to clinical and demographic information; they classified scans as ‘normal’ or ‘abnormal’ and if ‘abnormal’ as ‘AD’ or ‘FTD’. Results The QReport improved sensitivity for detecting volume loss and AD across all raters combined ( p = 0.015* and p = 0.002*, respectively). Only the consultant group’s accuracy increased significantly when using the QReport ( p = 0.02*) . Overall, raters’ agreement (Cohen’s κ ) with the ‘gold standard’ was not significantly affected by the QReport; only the consultant group improved significantly ( κ s 0.41➔0.55, p = 0.04*). Cronbach’s alpha for interrater agreement improved from 0.886 to 0.925, corresponding to an improvement from ‘good’ to ‘excellent’. Conclusion Our QReport referencing single-subject results to normative data alongside visual assessment improved sensitivity, accuracy, and interrater agreement for detecting volume loss. The QReport was most effective in the consultants, suggesting that experience is needed to fully benefit from the additional information provided by quantitative analyses. Key Points • The use of quantitative report alongside routine visual MRI assessment improves sensitivity and accuracy for detecting volume loss and AD vs visual assessment alone. • Consultant neuroradiologists’ assessment accuracy and agreement (kappa scores) significantly improved with the use of quantitative atrophy reports. • First multi-rater radiological clinical evaluation of visual quantitative MRI atrophy report for use as a diagnostic aid in dementia.
radiology, nuclear medicine & medical imaging
What problem does this paper attempt to address?