The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource
Elliot Sollis,Abayomi Mosaku,Ala Abid,Annalisa Buniello,Maria Cerezo,Laurent Gil,Tudor Groza,Osman Güneş,Peggy Hall,James Hayhurst,Arwa Ibrahim,Yue Ji,Sajo John,Elizabeth Lewis,Aoife McMahon,David Osumi-Sutherland,Kalliope Panoutsopoulou,Zoë Pendlington,Santhi Ramachandran,Ray Stefancsik,Jonathan Stewart,Patricia Whetzel,Robert Wilson,Lucia Hindorff,Fiona Cunningham,Michael Inouye,Helen Parkinson,Jacqueline A L MacArthur,Samuel A Lambert,Laura W Harris
DOI: https://doi.org/10.1093/nar/gkac1010
IF: 14.9
2022-11-09
Nucleic Acids Research
Abstract:Abstract The NHGRI-EBI GWAS Catalog (www.ebi.ac.uk/gwas) is a FAIR knowledgebase providing detailed, structured, standardised and interoperable genome-wide association study (GWAS) data to >200 000 users per year from academic research, healthcare and industry. The Catalog contains variant-trait associations and supporting metadata for >45 000 published GWAS across >5000 human traits, and >40 000 full P-value summary statistics datasets. Content is curated from publications or acquired via author submission of prepublication summary statistics through a new submission portal and validation tool. GWAS data volume has vastly increased in recent years. We have updated our software to meet this scaling challenge and to enable rapid release of submitted summary statistics. The scope of the repository has expanded to include additional data types of high interest to the community, including sequencing-based GWAS, gene-based analyses and copy number variation analyses. Community outreach has increased the number of shared datasets from under-represented traits, e.g. cancer, and we continue to contribute to awareness of the lack of population diversity in GWAS. Interoperability of the Catalog has been enhanced through links to other resources including the Polygenic Score Catalog and the International Mouse Phenotyping Consortium, refinements to GWAS trait annotation, and the development of a standard format for GWAS data.
biochemistry & molecular biology