CellMarkerPipe: cell marker identification and evaluation pipeline in single cell transcriptomes

Yinglu Jia,Pengchong Ma,Qiuming Yao
DOI: https://doi.org/10.1038/s41598-024-63492-z
IF: 4.6
2024-06-09
Scientific Reports
Abstract:Assessing marker genes from all cell clusters can be time-consuming and lack systematic strategy. Streamlining this process through a unified computational platform that automates identification and benchmarking will greatly enhance efficiency and ensure a fair evaluation. We therefore developed a novel computational platform, cellMarkerPipe (https://github.com/yao-laboratory/cellMarkerPipe), for automated cell-type specific marker gene identification from scRNA-seq data, coupled with comprehensive evaluation schema. CellMarkerPipe adaptively wraps around a collection of commonly used and state-of-the-art tools, including Seurat, COSG, SC3, SCMarker, COMET, and scGeneFit. From rigorously testing across diverse samples, we ascertain SCMarker's overall reliable performance in single marker gene selection, with COSG showing commendable speed and comparable efficacy. Furthermore, we demonstrate the pivotal role of our approach in real-world medical datasets. This general and opensource pipeline stands as a significant advancement in streamlining cell marker gene identification and evaluation, fitting broad applications in the field of cellular biology and medical research.
multidisciplinary sciences
What problem does this paper attempt to address?
The problem this paper attempts to address is the time-consuming and lack of systematic strategies in the identification and evaluation process of cell marker genes in single-cell transcriptome data. Specifically, evaluating marker genes from all cell clusters usually requires manual checking of information in literature or cell marker databases, which is not only time-consuming but also prone to bias due to the application of different methods. Therefore, the authors developed a new computational platform **cellMarkerPipe** for the automated identification and evaluation of cell type-specific marker genes in single-cell transcriptome data. This platform integrates various commonly used and cutting-edge tools, such as Seurat, COSG, SC3, SCMarker, COMET, and scGeneFit, and has been rigorously tested to verify its reliability and effectiveness in multiple samples. Additionally, the application of this platform in actual medical datasets also demonstrates its importance. Overall, cellMarkerPipe aims to simplify the identification and evaluation process of cell marker genes, improve efficiency, and ensure fair evaluation, making it suitable for a wide range of fields in cell biology and medical research.