Infusing structural assumptions into dimensionality reduction for single-cell RNA sequencing data to identify small gene sets

Maren Hackenberg,Niklas Brunn,Tanja Vogel,Harald Binder
DOI: https://doi.org/10.1101/2024.02.15.580085
2025-01-25
Abstract:Dimensionality reduction greatly facilitates the exploration of cellular heterogeneity in single-cell RNA sequencing data. While most of such approaches are data-driven, it can be useful to incorporate biologically plausible assumptions about the underlying structure or the experimental design. We propose the boosting autoencoder (BAE) approach, which combines the advantages of unsupervised deep learning for dimensionality reduction and boosting for formalizing assumptions. Specifically, our approach selects small sets of genes that explain latent dimensions. As illustrative applications, we explore the diversity of neural cell identities and temporal patterns of embryonic development.
Bioinformatics
What problem does this paper attempt to address?