Alzheimer's Disease Sequencing Project Release 4 Whole Genome Sequencing Dataset
Yuk Yee Leung,Wan-Ping Lee,Amanda B Kuzma,Heather I Nicaretta,Otto Valladares,Prabhakaran Gangadharan,Liming Qu,Yi Zhao,Ren Youli,Po-Liang Cheng,Pavel P Kuksa,Hui Wang,Heather White,Zivadin Katanic,Lauren Bass,Naveen Saravanan,Emily Greenfest-Allen Greenfest-Allen,Maureen Kirsch,Laura B Cantwell,Taha Iqbal,Nicholas R Wheeler,John J Farrer,Congcong Zhu,Shannon L Turner,Tamil Iniyan Gunasekaran,Pedro R Mena,Jimmy Jin,Luke Carter,Alzheimer's Disease Sequencing Project,Xiaoling Zhang,Badri N Vardarajan,Arthur W Toga,Michael Cuccaro,Timothy J Hohman,William S Bush,Adam C Naj,Eden Martin,Clifton Dalgard,Brian W Kunkle,Lindsay A Farrer,Richard P Mayeux,Jonathan L Haines,Margaret A Pericak-Vance,Gerard D Schellenberg,Li-San Wang
DOI: https://doi.org/10.1101/2024.12.03.24317000
2024-12-06
Abstract:The Alzheimer's Disease Sequencing Project (ADSP) is a national initiative to understand the genetic architecture of Alzheimer's Disease and Related Dementias (AD/ADRD) by sequencing whole genomes of affected participants and age-matched cognitive controls from diverse populations. The Genome Center for Alzheimer's Disease (GCAD) processed whole-genome sequencing data from 36,361 ADSP participants, including 35,014 genetically unique participants of which 45% are from non-European ancestry, across 17 cohorts in 14 countries in this fourth release (R4). This sequencing effort identified 387 million bi-allelic variants, 42 million short insertions/deletions, and 2.2 million structural variants. Annotations and quality control data are available for all variants and samples. Additionally, detailed phenotypes from 15,927 participants across 10 domains are also provided. A linkage disequilibrium panel was created using unrelated AD cases and controls. Researchers can access and analyze the genetic data via NIAGADS Data Sharing Service, the VariXam tool, or NIAGADS GenomicsDB.