Out-of-distribution Prediction with Disentangled Representations for Single-Cell RNA Sequencing Data

Mohammad Lotfollahi,Leander Dony,Harshita Agarwala,Fabian J. Theis
DOI: https://doi.org/10.1101/2021.09.01.458535
2021-01-01
Abstract:Learning robust representations can help uncover underlying biological variation in scRNA-seq data. Disentangled representation learning is one approach to obtain such informative as well as interpretable representations. Here, we learn disentangled representations of scRNA-seq data using β variational autoencoder (β-VAE) and apply the model for out-of-distribution (OOD) prediction. We demonstrate accurate gene expression predictions for cell-types absent from training in a perturbation and a developmental dataset. We further show that β-VAE outperforms a state-of-the-art disentanglement method for scRNA-seq in OOD prediction while achieving better disentanglement performance.
What problem does this paper attempt to address?