Data Resource Profile: Whole Blood DNA Methylation Resource in Generation Scotland (MeGS)

Rosie M. Walker,Daniel L. McCartney,Kevin Carr,Michael Barber,Xueyi Shen,Archie Campbell,Elena Bernabeu,Emma Aitken,Angie Fawkes,Nicola Wrobel,Lee Murphy,Heather C. Whalley,David M. Howard,Mark J. Adams,Konrad Rawlik,Pau Navarro,Albert Tenesa,Cathie L Sudlow,David J Porteous,Riccardo Marioni,Andrew M. McIntosh,Kathryn L. Evans
DOI: https://doi.org/10.1101/2024.04.30.24306314
2024-05-02
Abstract:We have generated whole-blood DNA methylation profiles from 18,869 Generation Scotland Scottish Family Health Study (GS) participants, resulting in, at the time of writing, the largest single-cohort DNA methylation resource for basic biological and medical research: Methylation in Generation Scotland (MeGS). GS is a community- and family-based cohort, which recruited over 24,000 participants from Scotland between 2006 and 2011. Comprehensive phenotype information, including detailed data on cognitive function, personality traits, and mental health, is available for all participants. The majority (83%) have genome-wide SNP genotype data (Illumina HumanOmniExpressExome-8 array v1.0 and v1.2), and over 97% of GS participants have given consent for health record linkage and re-contact. At baseline, blood-based DNA methylation was characterised at ∼850,000 sites across four batches using the Illumina EPICv1 array. MeGS participants were aged between 17 and 99 years at the time of enrolment to GS. Blood-based DNA methylation EPICv1 array profiles collected at a follow-up appointment that took place 4.3-12.2 years (mean=7.1 years) after baseline are also available for 796 MeGS participants. Access to MeGS for researchers in the UK and international collaborators is via application to the GS Access Committee ( ).
Genetic and Genomic Medicine
What problem does this paper attempt to address?
The paper describes the establishment of the "MeGS (The Scottish Family Health Study) Whole Blood DNA Methylation Resource". It aims to integrate whole blood DNA methylation data with rich phenotypic, genetic, and electronic health record linkage information for biomedical and basic research use. MeGS is currently the largest single-cohort DNA methylation resource worldwide, consisting of baseline data from 18,869 participants and follow-up data from 796 individuals. Approximately 850,000 DNA methylation patterns at around 850,000 loci were analyzed using the Illumina EPIC array in the study. The MeGS cohort spans from 17 to 99 years of age and represents a wide range of socio-economic backgrounds. Detailed data on cognitive function, personality traits, mental health, and health behaviors are available for most participants. The majority of participants have genotype information, and 97% consented to health record linkage and re-contact. In addition, MeGS is combined with other multi-omic data, such as proteomics and metabolomics, for disease prevention, detection, and monitoring research. The paper also discusses data collection methods, including data capture at baseline and follow-up, sample processing, DNA methylation analysis, and quality control procedures. Extensive clinical and biochemical data were obtained by linking with the Scottish National Health Service records. The study highlights the potential applications of MeGS in mental health, complex traits, and disease mechanisms, such as identifying associations with personality traits through genome-wide association studies (GWAS), predicting health outcomes, validating proxies for blood protein levels, and studying biomarkers of biological aging. The main strengths of MeGS are its large-scale cohort, extensive phenotypic information, long-term follow-up data, and health record linkage. However, the characteristics of participants, such as age, BMI, and socio-economic status, may influence the representativeness of the data. Despite these limitations, MeGS remains a powerful resource for understanding the biological mechanisms behind complex traits and diseases, as well as for developing clinical and research biomarkers.