Intersecting single-cell transcriptomics and genome-wide association studies identifies crucial cell populations and candidate genes for atherosclerosis

Lotte Slenders,Lennart P L Landsmeer,Kai Cui,Marie A C Depuydt,Maarten Verwer,Joost Mekke,Nathalie Timmerman,Noortje A M van den Dungen,Johan Kuiper,Menno P J Winther,Koen H M Prange,Wei Feng,Clint L Miller,Redouane Aherrahrou,Mete Civelek,Gert J de Borst,Dominique P V de Kleijn,Folkert W Asselbergs,Hester M den Ruijter,Arjan Boltjes,Gerard Pasterkamp,Sander W van der Laan,Michal Mokry,Menno P J de Winther,Wei Feng Ma
DOI: https://doi.org/10.1093/ehjopen/oeab043
2021-12-21
European Heart Journal Open
Abstract:Abstract Aims Genome-wide association studies (GWASs) have discovered hundreds of common genetic variants for atherosclerotic disease and cardiovascular risk factors. The translation of susceptibility loci into biological mechanisms and targets for drug discovery remains challenging. Intersecting genetic and gene expression data has led to the identification of candidate genes. However, previously studied tissues are often non-diseased and heterogeneous in cell composition, hindering accurate candidate prioritization. Therefore, we analysed single-cell transcriptomics from atherosclerotic plaques for cell-type-specific expression to identify atherosclerosis-associated candidate gene–cell pairs. Methods and results We applied gene-based analyses using GWAS summary statistics from 46 atherosclerotic and cardiovascular disease, risk factors, and other traits. We then intersected these candidates with single-cell RNA sequencing (scRNA-seq) data to identify genes specific for individual cell (sub)populations in atherosclerotic plaques. The coronary artery disease (CAD) loci demonstrated a prominent signal in plaque smooth muscle cells (SMCs) (SKI, KANK2, and SORT1) P-adj. = 0.0012, and endothelial cells (ECs) (SLC44A1, ATP2B1) P-adj. = 0.0011. Finally, we used liver-derived scRNA-seq data and showed hepatocyte-specific enrichment of genes involved in serum lipid levels. Conclusion We discovered novel and known gene–cell pairs pointing to new biological mechanisms of atherosclerotic disease. We highlight that loci associated with CAD reveal prominent association levels in mainly plaque SMC and EC populations. We present an intuitive single-cell transcriptomics-driven workflow rooted in human large-scale genetic studies to identify putative candidate genes and affected cells associated with cardiovascular traits. Collectively, our workflow allows for the identification of cell-specific targets relevant for atherosclerosis and can be universally applied to other complex genetic diseases and traits.
What problem does this paper attempt to address?