MGPfact : A Model-Based Factorization Method for scRNA Data Unveils Bifurcating Transcriptional Modules Underlying Cell Fate Determination

Jun Ren,Ying Zhou,Yudi Hu,Jing Yang,Hongkun Fang,Xuejing Lyu,Jintao Guo,Xiaodong Shi,Qiyuan Li
DOI: https://doi.org/10.1101/2024.04.02.587768
2024-10-27
Abstract:Manifold-learning is particularly useful to resolve the complex cellular state space from single-cell RNA sequences. While current manifold-learning methods provide insights into cell fate by inferring graph-based trajectory at cell level, challenges remain to retrieve interpretable biology underlying the diverse cellular states. Here, we described MGPfact , a model-based manifold-learning framework and capable to factorize complex development trajectories into independent bifurcation processes of gene sets, and thus enables trajectory inference based on relevant features. MGPfact offers more nuanced understanding of the biological processes underlying cellular trajectories with potential determinants. When bench-tested across 239 datasets, MGPfact showed advantages in major quantity-control metrics, such as branch division accuracy and trajectory topology, outperforming most established methods. In real datasets, MGPfact recovered the critical pathways and cell types in microglia development with experimentally valid regulons and markers. Furthermore, MGPfact discovered evolutionary trajectories of tumor-associated CD8 T cells and yielded new subtypes of CD8 T cells with gene expression signatures significantly predictive of the responses to immune checkpoint inhibitor in independent cohorts. In summary, MGPfact offers a manifold-learning framework in scRNA-seq data which enables feature selection for specific biological processes and contributing to advance our understanding of biological determination of cell fate.
Bioinformatics
What problem does this paper attempt to address?