Accelerating Single-Cell Sequencing Data Analysis with SciDAP: A User-Friendly Approach

Michael Kotliar,Andrey Kartashov,Artem Barski
DOI: https://doi.org/10.1101/2024.02.28.582604
2024-05-22
Abstract:Single-cell (sc) RNA, ATAC and Multiome sequencing became powerful tools for uncovering biological and disease mechanisms. Unfortunately, manual analysis of sc data presents multiple challenges due to large data volumes and complexity of configuration parameters. This complexity, as well as not being able to reproduce a computational environment, affects the reproducibility of analysis results. The Scientific Data Analysis Platform (https://SciDAP.com) allows biologists without computational expertise to analyze sequencing-based data using portable and reproducible pipelines written in Common Workflow Language (CWL). Our suite of computational pipelines addresses the most common needs in scRNA-Seq, scATAC-Seq and scMultiome data analysis. When executed on SciDAP, it offers a user-friendly alternative to manual data processing, eliminating the need for coding expertise. In this protocol, we describe the use of SciDAP to analyze scMultiome data. Similar approaches can be used for analysis of scRNA-Seq, scATAC-Seq and scVDJ-Seq datasets.
Bioinformatics
What problem does this paper attempt to address?
The paper attempts to address several key issues in single-cell sequencing data analysis: 1. **Large and Complex Data**: Single-cell RNA, ATAC, and multi-omics (Multiome) sequencing technologies have become powerful tools for uncovering biological and disease mechanisms. However, due to the large volume of data and complex configuration parameters, manual analysis of these data presents numerous challenges. 2. **Reproducibility Issues**: When manually processing single-cell data, inconsistent computing environments lead to difficulties in reproducing analysis results. Over time, the software used in the original analysis may become outdated or even deprecated, and replacing it with new versions may cause compatibility issues and result differences. 3. **Programming Skills Requirement**: Manual processing requires programming skills and is prone to errors. To address these issues, the paper introduces SciDAP (Scientific Data Analysis Platform), a user-friendly scientific data analysis platform. SciDAP ensures consistency in computing environments and reproducibility of analysis results through containerization and standardized workflows (using Common Workflow Language, CWL), while providing an intuitive web interface that allows for complex single-cell data analysis without the need for programming experience.