Extraction of biological signals by factorization enables the reliable analysis of single-cell transcriptomics

Feng Zeng,Xuwen Kong,Fan Yang,Ting Chen,Jiahuai Han
DOI: https://doi.org/10.1101/2023.03.04.531126
2023-01-01
Abstract:Accurately and reliably capturing actual biological signals from single-cell transcriptomics is vital for achieving legitimate scientific results, which is unfortunately hindered by the presence of various kinds of unwanted variations. Here we described a deep auto-regressive factor model known as scPhenoXMBD, demonstrated that each gene’s expression can be split into discrete components that represent biological signals and unwanted variations, which effectively mitigated the effects of unwanted variations in the data of single-cell sequencing. Using scPhenoXMBD, we evaluated various factors affecting IFN β -stimulated immune cells and demonstrated that biological signal extraction facilitates the identification of IFN β -responsive pathways and genes. Numerous experiments were conducted to show that scPhenoXMBD could be utilized successfully in enhancing cell clustering stability, obtaining identical cell populations from diverse data sources, advancing the single-cell CRISPR screening of functional elements, and minimizing the influence of inter-subject discrepancies in the cell-disease relationships. scPhenoXMBD is anticipated to be a dependable and repeatable method for the precise analysis of single-cell data. ### Competing Interest Statement The authors have declared no competing interest.
What problem does this paper attempt to address?