Assessing Risk Prediction Models Using Individual Participant Data from Multiple Studies
Lisa Pennells,Stephen Kaptoge,Ian R. White,Simon G. Thompson,Angela M. Wood,Robert W. Tipping,Aaron R. Folsom,David J. Couper,Christie M. Ballantyne,Josef Coresh,S. Goya Wannamethee,Richard W. Morris,Stefan Kiechl,Johann Willeit,Peter Willeit,Georg Schett,Shah Ebrahim,Debbie A. Lawlor,John W. Yarnell,John Gallacher,Mary Cushman,Bruce M. Psaty,Russ Tracy,Anne Tybjærg-Hansen,Jackie F. Price,Amanda J. Lee,Stela McLachlan,Kay-Tee Khaw,Nicholas J. Wareham,Hermann Brenner,Ben Schöttker,Heiko Müller,Jan-Håkan Jansson,Patrik Wennberg,Veikko Salomaa,Kennet Harald,Pekka Jousilahti,Erkki Vartiainen,Mark Woodward,Ralph B. D'Agostino,Else-Marie Bladbjerg,Torben Jørgensen,Yutaka Kiyohara,Hisatomi Arima,Yasufumi Doi,Toshiharu Ninomiya,Jacqueline M. Dekker,Giel Nijpels,Coen D. A. Stehouwer,Jussi Kauhanen,Jukka T. Salonen,Tom W. Meade,Jackie A. Cooper,Steven Shea,Angela Döring,Lewis H. Kuller,Greg Grandits,Richard F. Gillum,Michael Mussolino,Eric B. Rimm,Sue E. Hankinson,JoAnn E. Manson,Jennifer K. Pai,Susan Kirkland,Jonathan A. Shaffer,Daichi Shimbo,Stephan J. L. Bakker,Ron T. Gansevoort,Hans L. Hillege,Philippe Amouyel,Dominique Arveiler,Alun Evans,Jean Ferrières,Naveed Sattar,Rudi G. Westendorp,Brendan M. Buckley,Bernard Cantin,Benoît Lamarche,Elizabeth Barrett-Connor,Deborah L. Wingard,Richele Bettencourt,Vilmundur Gudnason,Thor Aspelund,Gunnar Sigurdsson,Bolli Thorsson,Maryam Kavousi,Jacqueline C. Witteman,Albert Hofman,Oscar H. Franco,Barbara V. Howard,Ying Zhang,Lyle Best,Jason G. Umans,Altan Onat,Johan Sundström,J. Michael Gaziano,Meir Stampfer,Paul M. Ridker,Michael Marmot,Robert Clarke,Rory Collins,Astrid Fletcher,Eric Brunner,Martin Shipley,Mika Kivimäki,Julie Buring,Nancy Cook,Ian Ford,James Shepherd,Stuart M. Cobbe,Michele Robertson,Matthew Walker,Sarah Watson,Myriam Alexander,Adam S. Butterworth,Emanuele Di Angelantonio,Pei Gao,Philip Haycock,David Wormser,John Danesh
DOI: https://doi.org/10.1093/aje/kwt298
2013-01-01
American Journal of Epidemiology
Abstract:Individual participant time-to-event data from multiple prospective epidemiologic studies enable detailed investigation into the predictive ability of risk models. Here we address the challenges in appropriately combining such information across studies. Methods are exemplified by analyses of log C-reactive protein and conventional risk factors for coronary heart disease in the Emerging Risk Factors Collaboration, a collation of individual data from multiple prospective studies with an average follow-up duration of 9.8 years (dates varied). We derive risk prediction models using Cox proportional hazards regression analysis stratified by study and obtain estimates of risk discrimination, Harrell's concordance index, and Royston's discrimination measure within each study; we then combine the estimates across studies using a weighted meta-analysis. Various weighting approaches are compared and lead us to recommend using the number of events in each study. We also discuss the calculation of measures of reclassification for multiple studies. We further show that comparison of differences in predictive ability across subgroups should be based only on within-study information and that combining measures of risk discrimination from case-control studies and prospective studies is problematic. The concordance index and discrimination measure gave qualitatively similar results throughout. While the concordance index was very heterogeneous between studies, principally because of differing age ranges, the increments in the concordance index from adding log C-reactive protein to conventional risk factors were more homogeneous.