ICARUS v3, a massively scalable web server for single cell RNA-seq analysis of millions of cells

Andrew Jiang,Russell G Snell,Klaus Lehnert
DOI: https://doi.org/10.1093/bioinformatics/btae167
IF: 5.8
2024-03-27
Bioinformatics
Abstract:Abstract Motivation In recent years, improvements in throughput of single cell RNA-seq have resulted in a significant increase in the number of cells profiled. The generation of single cell RNA-seq datasets comprising >1 million cells is becoming increasingly common, giving rise to demands for more efficient computational workflows. Results We present an update to our single cell RNA-seq analysis web server application, ICARUS (available at https://launch.icarus-scrnaseq.cloud.edu.au) that allows effective analysis of large-scale single cell RNA-seq datasets. ICARUS v3 utilizes the geometric cell sketching method to subsample cells from the overall dataset for dimensionality reduction and clustering that can be then projected to the large dataset. We then extend this functionality to select a representative subset of cells for downstream data analysis applications including differential expression analysis, gene co-expression network construction, gene regulatory network construction, trajectory analysis, cell-cell communication inference and cell cluster associations to GWAS traits. We demonstrate analysis of single cell RNA-seq datasets using ICARUS v3 of 1.3 million cells completed within the hour. Availability ICARUS is available at https://launch.icarus-scrnaseq.cloud.edu.au. Supplementary information Supplementary data are available at Bioinformatics online.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?