Unique Signatures of Highly Constrained Genes Across Publicly Available Genomic Databases

Klaus Schmitz-Abe,Qifei Li,Sunny Greene,Michela Borrelli,Shiyu Luo,Madesh C. Ramesh,Pankaj B. Agrawal
DOI: https://doi.org/10.1101/2024.09.05.611529
2024-09-05
Abstract:Publicly available genomic databases and genetic constraint scores are crucial in understanding human population variation and the identification of variants that are likely to have a deleterious impact causing human disease. We utilized the one of largest publicly available databases, gnomAD, to determine genes that are highly constrained for only LoF, only missense, and both LoF/missense variants, identified their unique signatures, and explored their causal relationship with human conditions. Those genes were evaluated for unique patterns including their chromosomal location, tissue level expression, gene ontology analysis, and gene family categorization using multiple publicly available databases. Those highly constrained genes associated with human disease, we identified unique patterns of inheritance, protein size, and enrichment in distinct molecular pathways. In addition, we identified a cohort of highly constrained genes that are currently not known to cause human disease, that we suggest will be candidates to pursue as novel disease-associated genes. In summary, these insights not only elucidate biological pathways of highly constrained genes that expand our understanding of critical cellular proteins but also advance research in rare diseases.
Genetics
What problem does this paper attempt to address?