Single cell variant to enhancer to gene map for coronary artery disease
Junedh M Amrute,Paul C Lee,Ittai Eres,Chang Jie Mick Lee,Andrea Bredemeyer,Maya U Sheth,Tracy Yamawaki,Rijan Gurung,George Anene Nzelu,Wei-Lin Qui,Soumya Kundu,Daniel Y Li,Markus Ramste,Daniel Lu,Anthony Tan,Chul-Joo Kang,Ryan E Wagoner,Arturo Alisio,Paul Cheng,Quanyi Zhao,Clint L Miller,Ira Hall,Rajat M. Gupta,Yi-Hsiang Hsu,Saptarsi M Haldar,Kory J Lavine,Simon Jackson,Robin Andersson,Jesse M Engreitz,Roger SY Foo,Chi-Ming Kevin Li,Brandon Ason,Thomas Quertermous,Nathan O Stitziel
DOI: https://doi.org/10.1101/2024.11.13.24317257
2024-11-13
Abstract:Although genome wide association studies (GWAS) in large populations have identified hundreds of variants associated with common diseases such as coronary artery disease (CAD), most disease-associated variants lie within non-coding regions of the genome, rendering it difficult to determine the downstream causal gene and cell type. Here, we performed paired single nucleus gene expression and chromatin accessibility profiling from 44 human coronary arteries. To link disease variants to molecular traits, we developed a meta-map of 88 samples and discovered 11,182 single-cell chromatin accessibility quantitative trait loci (caQTLs). Heritability enrichment analysis and disease variant mapping demonstrated that smooth muscle cells (SMCs) harbor the greatest genetic risk for CAD. To capture the continuum of SMC cell states in disease, we used dynamic single cell caQTL modeling for the first time in tissue to uncover QTLs whose effects are modified by cell state and expand our insight into genetic regulation of heterogenous cell populations. Notably, we identified a variant in the CAD GWAS locus which becomes a caQTL as SMCs de-differentiate by changing a transcription factor binding site for EGR1/2. To unbiasedly prioritize functional candidate genes, we built a genome-wide single cell variant to enhancer to gene (scV2E2G) map for human CAD to link disease variants to causal genes in cell types. Using this approach, we found several hundred genes predicted to be linked to disease variants in different cell types. Next, we performed genome-wide Hi-C in 16 human coronary arteries to build tissue specific maps of chromatin conformation and link disease variants to integrated chromatin hubs and distal target genes. Using this approach, we show that rs4887091 within the CAD GWAS locus modulates function of a super chromatin interactome through a change in a CTCF binding site. Finally, we used CRISPR interference to validate a distal gene, , liked to a CAD GWAS locus. Collectively we provide a disease-agnostic framework to translate human genetic findings to identify pathologic cell states and genes driving disease, producing a comprehensive scV2E2G map with genetic and tissue level convergence for future mechanistic and therapeutic studies.