GOWDL: gene ontology-driven wide and deep learning model for cell typing of scRNA-seq data

Antonino Fiannaca,Massimo La Rosa,Laura La Paglia,Salvatore Gaglio,Alfonso Urso
DOI: https://doi.org/10.1093/bib/bbad332
IF: 9.5
2023-09-22
Briefings in Bioinformatics
Abstract:Abstract Single-cell RNA-sequencing (scRNA-seq) allows for obtaining genomic and transcriptomic profiles of individual cells. That data make it possible to characterize tissues at the cell level. In this context, one of the main analyses exploiting scRNA-seq data is identifying the cell types within tissue to estimate the quantitative composition of cell populations. Due to the massive amount of available scRNA-seq data, automatic classification approaches for cell typing, based on the most recent deep learning technology, are needed. Here, we present the gene ontology-driven wide and deep learning (GOWDL) model for classifying cell types in several tissues. GOWDL implements a hybrid architecture that considers the functional annotations found in Gene Ontology and the marker genes typical of specific cell types. We performed cross-validation and independent external testing, comparing our algorithm with 12 other state-of-the-art predictors. Classification scores demonstrated that GOWDL reached the best results over five different tissues, except for recall, where we got about 92% versus 97% of the best tool. Finally, we presented a case study on classifying immune cell populations in breast cancer using a hierarchical approach based on GOWDL.
biochemical research methods,mathematical & computational biology
What problem does this paper attempt to address?