CryptoCEN: A Co-Expression Network for Cryptococcus neoformans reveals novel proteins involved in DNA damage repair
Matthew J O'Meara,Jackson R Rapala,Connie B Nichols,A Christina Alexandre,R Blake Billmyre,Jacob L Steenwyk,J Andrew Alspaugh,Teresa R O'Meara,Matthew J. O’Meara,Jackson R. Rapala,Connie B. Nichols,A. Christina Alexandre,R. Blake Billmyre,J. Andrew Alspaugh,Teresa R. O’Meara
DOI: https://doi.org/10.1371/journal.pgen.1011158
IF: 4.5
2024-02-16
PLoS Genetics
Abstract:Elucidating gene function is a major goal in biology, especially among non-model organisms. However, doing so is complicated by the fact that molecular conservation does not always mirror functional conservation, and that complex relationships among genes are responsible for encoding pathways and higher-order biological processes. Co-expression, a promising approach for predicting gene function, relies on the general principal that genes with similar expression patterns across multiple conditions will likely be involved in the same biological process. For Cryptococcus neoformans , a prevalent human fungal pathogen greatly diverged from model yeasts, approximately 60% of the predicted genes in the genome lack functional annotations. Here, we leveraged a large amount of publicly available transcriptomic data to generate a C . neoformans Co-Expression Network (CryptoCEN), successfully recapitulating known protein networks, predicting gene function, and enabling insights into the principles influencing co-expression. With 100% predictive accuracy, we used CryptoCEN to identify 13 new DNA damage response genes, underscoring the utility of guilt-by-association for determining gene function. Overall, co-expression is a powerful tool for uncovering gene function, and decreases the experimental tests needed to identify functions for currently under-annotated genes. A central problem in genetics is the connection between genotype and phenotype. Computational approaches to predict gene function can be especially useful for non-model organisms where extensive functional testing has not yet been performed. Co-expression to predict gene function is based on the principle that genes that share similar expression patterns across multiple environmental conditions or perturbations are likely to be involved in the same biological process. Here, we collected transcriptomic data from the Cryptococcus neoformans field and built a robust co-expression network for predicting gene function, especially biological process information. Not only are we able to use this network for retrospective analysis of known gene clusters, but we are also able to make prospective predictions about gene function, including the well-studied processes of capsule and ergosterol biosynthesis. We also discovered a new role for 13 genes in the response to DNA damaging agents, showing that co-expression can reveal new players in conserved biological processes.
genetics & heredity