Clover: An unbiased method for prioritizing differentially expressed genes using a data‐driven approach

Gina Miku Oba,Ryuichiro Nakato
DOI: https://doi.org/10.1111/gtc.13119
2024-04-12
Genes to Cells
Abstract:We developed Clover (https://github.com/G708/Clover), a data‐driven scoring method to specifically highlight the understudied genes. Clover aims to prioritize genes associated with important molecular mechanisms by integrating three metrics: the likelihood of appearing in the DEG list, tissue specificity, and number of publications. Our method offers a novel approach to gene characterization and has the potential to expand our understanding of gene functions. Identifying key genes from a list of differentially expressed genes (DEGs) is a critical step in transcriptome analysis. However, current methods, including Gene Ontology analysis and manual annotation, essentially rely on existing knowledge, which is highly biased depending on the extent of the literature. As a result, understudied genes, some of which may be associated with important molecular mechanisms, are often ignored or remain obscure. To address this problem, we propose Clover, a data‐driven scoring method to specifically highlight understudied genes. Clover aims to prioritize genes associated with important molecular mechanisms by integrating three metrics: the likelihood of appearing in the DEG list, tissue specificity, and number of publications. We applied Clover to Alzheimer's disease data and confirmed that it successfully detected known associated genes. Moreover, Clover effectively prioritized understudied but potentially druggable genes. Overall, our method offers a novel approach to gene characterization and has the potential to expand our understanding of gene functions. Clover is an open‐source software written in Python3 and available on GitHub at https://github.com/G708/Clover.
cell biology,genetics & heredity
What problem does this paper attempt to address?