Tools for the Analysis of High-Dimensional Single-Cell RNA Sequencing Data

Wu, Yan,Zhang, Kun
DOI: https://doi.org/10.1038/s41581-020-0262-0
IF: 42.439
2020-01-01
Nature Reviews Nephrology
Abstract:Breakthroughs in the development of high-throughput technologies for profiling transcriptomes at the single-cell level have helped biologists to understand the heterogeneity of cell populations, disease states and developmental lineages. However, these single-cell RNA sequencing (scRNA-seq) technologies generate an extraordinary amount of data, which creates analysis and interpretation challenges. Additionally, scRNA-seq datasets often contain technical sources of noise owing to incomplete RNA capture, PCR amplification biases and/or batch effects specific to the patient or sample. If not addressed, this technical noise can bias the analysis and interpretation of the data. In response to these challenges, a suite of computational tools has been developed to process, analyse and visualize scRNA-seq datasets. Although the specific steps of any given scRNA-seq analysis might differ depending on the biological questions being asked, a core workflow is used in most analyses. Typically, raw sequencing reads are processed into a gene expression matrix that is then normalized and scaled to remove technical noise. Next, cells are grouped according to similarities in their patterns of gene expression, which can be summarized in two or three dimensions for visualization on a scatterplot. These data can then be further analysed to provide an in-depth view of the cell types or developmental trajectories in the sample of interest. This Review provides the non-expert reader with an overview of the different steps involved in the analysis of single-cell RNA sequencing data. The authors also provide insight into the strengths and pitfalls of available analysis tools.
What problem does this paper attempt to address?