Machine learning for single cell genomics data analysis

Félix Raimundo,Laetitia Papaxanthos,Céline Vallot,Jean-Philippe Vert
DOI: https://doi.org/10.1101/2021.02.04.429763
2021-02-05
Abstract:Abstract Single-cell omics technologies produce large quantities of data describing the genomic, transcriptomic or epigenomic profiles of many individual cells in parallel. In order to infer biological knowledge and develop predictive models from these data, machine learning (ML)-based model are increasingly used due to their flexibility, scalability, and impressive success in other fields. In recent years, we have seen a surge of new ML-based method development for low-dimensional representations of single-cell omics data, batch normalization, cell type classification, trajectory inference, gene regulatory network inference or multimodal data integration. To help readers navigate this fast-moving literature, we survey in this review recent advances in ML approaches developed to analyze single-cell omics data, focusing mainly on peer-reviewed publications published in the last two years (2019-2020).
What problem does this paper attempt to address?