Inferring latent temporal progression and regulatory networks from cross-sectional transcriptomic data of cancer samples

Xiaoqiang Sun,Ji Zhang,Qing Nie
DOI: https://doi.org/10.1371/journal.pcbi.1008379
2021-01-01
PLoS Computational Biology
Abstract:Author summary Reconstructing gene regulatory network (GRN) is an essential question in systems biology. The lack of temporal information in sample-based transcriptomic data leads to a major challenge for inferring GRN and its translation to precision medicine. To address the above challenge, we propose to decode the latent temporal information underlying cancer progression via ordering patient samples based on transcriptomic similarity, and design a latent-temporal progression-based Bayesian method to infer GRNs from sample-based transcriptomic data of cancer patients. The advantages of our method include its capability to infer causal GRNs (with directed and signed edges) and its robustness to the measurement variability in the data. Performance evaluation using both simulated data and real data demonstrate that our method outperforms other existing methods in both pseudotime inference and GRN inference. Our method is then applied to reconstruct EMT regulatory networks in bladder cancer and to identify key regulators underlying progression of breast cancer. Importantly, the predicted key regulators/interactions are experimentally validated. Our study suggests that inferring dynamic progression trajectory from static expression data of tumor samples helps to uncover regulatory mechanisms underlying cancer progression and to discovery key regulators which may be used as candidate drug targets. Unraveling molecular regulatory networks underlying disease progression is critically important for understanding disease mechanisms and identifying drug targets. The existing methods for inferring gene regulatory networks (GRNs) rely mainly on time-course gene expression data. However, most available omics data from cross-sectional studies of cancer patients often lack sufficient temporal information, leading to a key challenge for GRN inference. Through quantifying the latent progression using random walks-based manifold distance, we propose a latent-temporal progression-based Bayesian method, PROB, for inferring GRNs from the cross-sectional transcriptomic data of tumor samples. The robustness of PROB to the measurement variabilities in the data is mathematically proved and numerically verified. Performance evaluation on real data indicates that PROB outperforms other methods in both pseudotime inference and GRN inference. Applications to bladder cancer and breast cancer demonstrate that our method is effective to identify key regulators of cancer progression or drug targets. The identified ACSS1 is experimentally validated to promote epithelial-to-mesenchymal transition of bladder cancer cells, and the predicted FOXM1-targets interactions are verified and are predictive of relapse in breast cancer. Our study suggests new effective ways to clinical transcriptomic data modeling for characterizing cancer progression and facilitates the translation of regulatory network-based approaches into precision medicine.
What problem does this paper attempt to address?