External validation of a shortened screening tool using individual participant data meta-analysis: A case study of the Patient Health Questionnaire-Dep-4
Daphna Harel,Brooke Levis,Ying Sun,Felix Fischer,John P A Ioannidis,Pim Cuijpers,Scott B Patten,Roy C Ziegelstein,Sarah Markham,Andrea Benedetti,Brett D Thombs,DEPRESsion Screening Data DEPRESSD PHQ Collaboration,Chen He,Yin Wu,Ankur Krishnan,Parash Mani Bhandari,Dipika Neupane,Zelalem Negeri,Mahrukh Imran,Danielle B Rice,Kira E Riehm,Marleine Azar,Alexander W Levis,Jill Boruff,Simon Gilbody,Lorie A Kloda,Dagmar Amtmann,Liat Ayalon,Hamid R Baradaran,Anna Beraldi,Charles N Bernstein,Arvin Bhana,Ryna Imma Buji,Marcos H Chagas,Juliana C N Chan,Lai Fong Chan,Dixon Chibanda,Aaron Conway,Federico M Daray,Janneke M de Man-van Ginkel,Crisanto Diez-Quevedo,Sally Field,Jane R W Fisher,Daniel Fung,Emily C Garman,Alan J Flisher,Bizu Gelaye,Leila Gholizadeh,Lorna J Gibson,Eric P Green,Brian J Hall,Liisa Hantsoo,Emily E Haroz,Martin Härter,Ulrich Hegerl,Leanne Hides,Stevan E Hobfoll,Simone Honikman,Marie Hudson,Thomas Hyphantis,Masatoshi Inagaki,Hong Jin Jeon,Nathalie Jetté,Mohammad E Khamseh,Sebastian Köhler,Brandon A Kohrt,Yunxin Kwan,Femke Lamers,Maria Asunción Lara,Holly F Levin-Aspenson,Shen-Ing Liu,Manote Lotrakul,Sonia R Loureiro,Bernd Löwe,Nagendra P Luitel,Crick Lund,Ruth Ann Marrie,Brian P Marx,Sherina Mohd Sidik,Tiago N Munhoz,Kumiko Muramatsu,Juliet E M Nakku,Laura Navarrete,Flávia L Osório,Philippe Persoons,Angelo Picardi,Stephanie L Pugh,Terence J Quinn,Elmars Rancans,Sujit D Rathod,Katrin Reuter,Heather J Rowe,Iná S Santos,Miranda T Schram,Juwita Shaaban,Eileen H Shinn,Lena Spangenberg,Lesley Stafford,Sharon C Sung,Keiko Suzuki,Pei Lin Lynnette Tan,Martin Taylor-Rowan,Thach D Tran,Christina M van der Feltz-Cornelis,Thandi van Heyningen,Henk C van Weert,Lynne I Wagner,Jian Li Wang,David Watson,Karen Wynter,Mitsuhiko Yamada,Qing Zhi Zeng,Yuying Zhang
DOI: https://doi.org/10.1016/j.ymeth.2021.11.005
IF: 4.647
Methods
Abstract:Shortened versions of self-reported questionnaires may be used to reduce respondent burden. When shortened screening tools are used, it is desirable to maintain equivalent diagnostic accuracy to full-length forms. This manuscript presents a case study that illustrates how external data and individual participant data meta-analysis can be used to assess the equivalence in diagnostic accuracy between a shortened and full-length form. This case study compares the Patient Health Questionnaire-9 (PHQ-9) and a 4-item shortened version (PHQ-Dep-4) that was previously developed using optimal test assembly methods. Using a large database of 75 primary studies (34,698 participants, 3,392 major depression cases), we evaluated whether the PHQ-Dep-4 cutoff of ≥ 4 maintained equivalent diagnostic accuracy to a PHQ-9 cutoff of ≥ 10. Using this external validation dataset, a PHQ-Dep-4 cutoff of ≥ 4 maximized the sum of sensitivity and specificity, with a sensitivity of 0.88 (95% CI 0.81, 0.93), 0.68 (95% CI 0.56, 0.78), and 0.80 (95% CI 0.73, 0.85) for the semi-structured, fully structured, and MINI reference standard categories, respectively, and a specificity of 0.79 (95% CI 0.74, 0.83), 0.85 (95% CI 0.78, 0.90), and 0.83 (95% CI 0.80, 0.86) for the semi-structured, fully structured, and MINI reference standard categories, respectively. While equivalence with a PHQ-9 cutoff of ≥ 10 was not established, we found the sensitivity of the PHQ-Dep-4 to be non-inferior to that of the PHQ-9, and the specificity of the PHQ-Dep-4 to be marginally smaller than the PHQ-9.