Uncovering the Key Dimensions of High-Throughput Biomolecular Data Using Deep Learning.

Shixiong Zhang,Xiangtao Li,Qiuzhen Lin,Jiecong Lin,Ka-Chun Wong
DOI: https://doi.org/10.1093/nar/gkaa191
IF: 14.9
2020-01-01
Nucleic Acids Research
Abstract:Recent advances in high-throughput single-cell RNA-seq have enabled us to measure thousands of gene expression levels at single-cell resolution. However, the transcriptomic profiles are high-dimensional and sparse in nature. To address it, a deep learning framework based on auto-encoder, termed DeepAE, is proposed to elucidate high-dimensional transcriptomic profiling data in an encode-decode manner. Comparative experiments were conducted on nine transcriptomic profiling datasets to compare DeepAE with four benchmark methods. The results demonstrate that the proposed DeepAE outperforms the benchmark methods with robust performance on uncovering the key dimensions of single-cell RNA-seq data. In addition, we also investigate the performance of DeepAE in other contexts and platforms such as mass cytometry and metabolic profiling in a comprehensive manner. Gene ontology enrichment and pathology analysis are conducted to reveal the mechanisms behind the robust performance of DeepAE by uncovering its key dimensions.
What problem does this paper attempt to address?