Characteristic Direction Approach to Identify Differentially Expressed Genes

Neil R. Clark,Kevin Hu,Edward Y. Chen,Qioanan Duan,Avi Ma`ayan
DOI: https://doi.org/10.48550/arXiv.1307.8366
2013-08-01
Abstract:Genome-wide gene expression profiles, as measured with microarrays or RNA-Seq experiments, have revolutionized biological and biomedical research by providing a quantitative measure of the entire mRNA transcriptome. Typically, researchers set up experiments where control samples are compared to a treatment condition, and using the t-test they identify differentially expressed genes upon which further analysis and ultimately biological discovery from such experiments is based. Here we describe an alternative geometrical approach to identify differentially expressed genes. We show that this alternative method, called the Characteristic Direction, is capable of identifying more relevant genes. We evaluate our approach in three case studies. In the first two, we match transcription factor targets determined by ChIP-seq profiling with differentially expressed genes after the same transcription factor knockdown or over-expression in mammalian cells. In the third case study, we evaluate the quality of enriched terms when comparing normal epithelial cells with cancer stem cells. In conclusion, we demonstrate that the Characteristic Direction approach is much better in calling the significantly differentially expressed genes and should replace the widely currently in used t-test method for this purpose. Implementations of the method in MATLAB, Python and Mathematica are available at: <a class="link-external link-http" href="http://www.maayanlab.net/CD" rel="external noopener nofollow">this http URL</a>.
Applications,Quantitative Methods
What problem does this paper attempt to address?