Comparative Performance of Five Cognitive Screening Tests in a Large Sample of Seniors

Jurij Dreo,Jan Jug,Tisa Pavlovčič,Ajda Ogrin,Anita Demšar,Barbara Aljaž,Filip Agatić,Uros Marusic
DOI: https://doi.org/10.1159/000540225
2024-07-16
Abstract:Introduction: Recent introductions of disease-modifying treatments for Alzheimer's disease have re-invigorated the cause of early dementia detection. Cognitive "paper and pencil" tests represent the bedrock of clinical assessment, because they are cheap, easy to perform, and do not require brain imaging or biological testing. Cognitive tests vary greatly in duration, complexity, sociolinguistic biases, probed cognitive domains, and their specificity and sensitivity of detecting cognitive impairment (CI). Consequently, an ecologically valid head-to-head comparison seems essential for evidence-based dementia screening. Method: We compared five tests: Montreal cognitive assessment (MoCA), Alzheimer's disease assessment scale-cognitive subscale (ADAS), Addenbrooke's cognitive examination (ACE-III), euro-coin handling test (Eurotest), and image identification test (Phototest) on a large sample of seniors (N = 456, 77.9 ± 8 years, 71% females). Their specificity and sensitivity were estimated in a novel way by contrasting each test's outcome to the majority outcome across the remaining tests (comparative specificity and sensitivity calculation [CSSC]). This obviates the need for an a priori gold standard such as a clinically clear-cut sample of dementia/MCI/controls. We posit that the CSSC results in a more ecologically valid estimation of clinical performance while precluding biases resulting from different dementia/MCI diagnostic criteria and the proficiency in detecting these conditions. Results: There exists a stark trade-off between behavioral test specificity and sensitivity. The test with the highest specificity had the lowest sensitivity, and vice versa. The comparative specificities and sensitivities were, respectively: Phototest (97%, 47%), Eurotest (94%, 55%), ADAS (90%, 68%), ACE-III (72%, 77%), MoCA (55%, 95%). Conclusion: Assuming a CI prevalence of 10%, the shortest (∼3 min) and the simplest instrument, the Phototest, was shown to have the best overall performance (accuracy 92%, PPV 66%, NPV 94%).
What problem does this paper attempt to address?