Construct a molecular associations network to systematically understand intermolecular associations in<i>Human</i>cells
Hai-Cheng Yi,Zhu-Hong You,Zhen-Hao Guo
DOI: https://doi.org/10.1101/693051
2019-01-01
Abstract:AbstractA key aim of post-genomic biomedical research is to systematically integrate and model all molecules and their interactions in living cells. Existing research usually only focusing on the associations between individual or very limited type of molecules. But the interactions between molecules shouldn’t be isolated but interconnected and influenced. In this study, we revealed, constructed and analyzed a large-scale molecular association network of multiple biomolecules inhumancells by modeling all associations among lncRNA, miRNA, protein, circRNA, microbe, drug, and disease, in which various associations are interconnected and any type of associations can be predicted. More specifically, we defined the molecular associations network and constructed a molecular associations dataset containing 105546 associations. Then, each node is represented by its attribute feature and network embedding learned by Structural Deep Network Embedding. Moreover, Random Forest is trained to predict any kind of associations. And we compared the features and classifiers under five-fold cross-validation. Our method achieves a remarkable performance on entire molecular associations network with an AUC of 0.9552 and an AUPR of 0.9338. To further evaluate the performance of our method, a case study for predicting lncRNA-protein interactions was executed. The experimental results demonstrate that the systematic insight for understanding the synergistic interactions between various molecules and complex diseases. It is anticipated that this work can bring beneficial inspiration and advance related systems biology and biomedical research.Author SummaryThe interactions between the various biomolecules in the cells should not be isolated, but interconnected and influenced. There have been many valuable studies on the interactions between two individual molecules. Based on a systematic and holistic perspective, we revealed and constructed a large-scale molecular associations network by combining various associations in human living cells, including miRNA-lncRNA association, miRNA-disease association, miRNA-protein interaction, lncRNA-disease association, protein-protein interaction, protein-disease association, drug-protein interaction, drug-disease interaction, and lncRNA-protein interaction. To model and analyze this molecular associations network, we employed the network representation learning model to learn how to represent the node. Each node in the network can be represented by network embedding and its own attribute information. Any node can be classified. And any type of the associations in this network can be predicted, which can be considered as link prediction task. Our work provides a new systematic view and conceptual framework to understand complex diseases and life activities. It is anticipated that our study can advance related biological macromolecules, systems biology and biomedical research, and bring some meaningful inspiration.