CWAS-Plus: Estimating category-wide association of rare noncoding variation from whole-genome sequencing data with cell-type-specific functional data

Yujin Kim,Minwoo Jeong,In Gyeong Koh,Chanhee Kim,Hyeji Lee,Jae Hyun Kim,Ronald Yurko,Il Bin Kim,Jeongbin Park,Donna M. Werling,Stephan J. Sanders,Joon-Yong An
DOI: https://doi.org/10.1101/2024.04.15.24305828
2024-04-15
Abstract:Variants in cis-regulatory elements link the noncoding genome to human brain pathology; however, detailed analytic tools for understanding the association between cell-level brain pathology and noncoding variants are lacking. CWAS-Plus, adapted from a Python package for category-wide association testing (CWAS) employs both whole-genome sequencing and user-provided functional data to enhance noncoding variant analysis, with a faster and more efficient execution of the CWAS workflow. Here, we used single-nuclei assay for transposase-accessible chromatin with sequencing to facilitate CWAS-guided noncoding variant analysis at cell-type specific enhancers and promoters. Examining autism spectrum disorder whole-genome sequencing data (n = 7,280), CWAS-Plus identified noncoding variant associations in transcription factor binding sites within conserved loci. Independently, in Alzheimer’s disease whole-genome sequencing data (n = 1,087), CWAS-Plus detected rare noncoding variant associations in microglia-specific regulatory elements. These findings highlight CWAS-Plus’s utility in genomic disorders and scalability for processing large-scale whole-genome sequencing data and in multiple-testing corrections. CWAS-Plus and its user manual are available at and , respectively.
Genetic and Genomic Medicine
What problem does this paper attempt to address?