“Big Data” in Alzheimer’s Disease Research: an Environmental Scan

Y. Hong,M. K. Pickering,E. M. Perfetto,J. Albrecht,B. Ung,K. Yang,H. Lederer
DOI: https://doi.org/10.1016/j.jval.2015.09.2569
IF: 5.156
2015-01-01
Value in Health
Abstract:Repositories of “big data” have the potential to play a pivotal role in advancing Alzheimer’s disease (AD) research. Globally, AD-specific data are being generated and aggregated into research databases. An environmental scan was conducted to identify worldwide AD-specific databases and types of data being aggregated and used for AD research. A Google search was conducted monthly starting in September 2014 to present. For each database, its URL, geographic location, funding source, and type of data collected and/or stored were identified. A categorization scheme was established to classify types of data. Three reviewers independently categorized the data as: 1 = claims, 2 = laboratory, 3 = genetic, 4 = imaging, 5 = patient/caregiver-reported questionnaires, 6a= longitudinal study, 6b = patient registry, 7 = clinical data, 8 = electronic medical records, 9 = neuropathology, and O = other. A fourth reviewer resolved discrepancies. A total of 53 AD databases were identified, both within (28/53) and outside the U.S. (21/53). Sources from outside the U.S. include United Kingdom, Australia, Belgium, France, etc. Four databases represent U.S. and non-U.S. collaborations. The National Institutes of Health is the most common funding source (14/53). Clinical data were found to be the most common (30/53); whereas, databases containing AD-specific claims data appear to be lacking. Additional gaps include a comprehensive database linking claims data with patient-level data from AD longitudinal studies, patient registries, electronic medical records, or genetic data. Patient registry databases lack pre-diagnosis and early-life data, as they enroll patients upon diagnosis with AD or mild cognitive impairment. Various types of data are being aggregated into numerous AD-specific research databases worldwide. However, gaps exist that may limit the utility of these databases in making advances in the AD research. Efforts are needed to explore opportunities to merge and expand these databases to fill these critical gaps.
What problem does this paper attempt to address?