iPiDA-GCN: Identification of piRNA-disease associations based on Graph Convolutional Network

Jialu Hou,Hang Wei,Bin Liu
DOI: https://doi.org/10.1371/journal.pcbi.1010671
2022-10-28
PLoS Computational Biology
Abstract:Piwi-interacting RNAs (piRNAs) play a critical role in the progression of various diseases. Accurately identifying the associations between piRNAs and diseases is important for diagnosing and prognosticating diseases. Although some computational methods have been proposed to detect piRNA-disease associations, it is challenging for these methods to effectively capture nonlinear and complex relationships between piRNAs and diseases because of the limited training data and insufficient association representation. With the growth of piRNA-disease association data, it is possible to design a more complex machine learning method to solve this problem. In this study, we propose a computational method called iPiDA-GCN for piRNA-disease association identification based on graph convolutional networks (GCNs). The iPiDA-GCN predictor constructs the graphs based on piRNA sequence information, disease semantic information and known piRNA-disease associations. Two GCNs (Asso-GCN and Sim-GCN) are used to extract the features of both piRNAs and diseases by capturing the association patterns from piRNA-disease interaction network and two similarity networks. GCNs can capture complex network structure information from these networks, and learn discriminative features. Finally, the full connection networks and inner production are utilized as the output module to predict piRNA-disease association scores. Experimental results demonstrate that iPiDA-GCN achieves better performance than the other state-of-the-art methods, benefitted from the discriminative features extracted by Asso-GCN and Sim-GCN. The iPiDA-GCN predictor is able to detect new piRNA-disease associations to reveal the potential pathogenesis at the RNA level. The data and source code are available at http://bliulab.net/iPiDA-GCN/. PiRNAs play critical roles in various biological processes and the abnormal expression of piRNAs may lead to diseases. Meanwhile, several biological experiments show that piRNAs have the potential to be biomarkers or therapeutic targets to diagnose and prognosticate diseases. Some computational methods have been proposed to detect piRNA-disease associations, and provide promising results. However, with the increasing discovery of piRNA-disease associations, the existing methods fail to capture nonlinear and complex association patterns because of the limited training data and insufficient association representation. To overcome above questions, a novel computational method named iPiDA-GCN is proposed for piRNA-disease association identification based on graph convolutional networks. iPiDA-GCN constructs heterogeneous biological networks, and designs Asso-GCN and Sim-GCN modules for learning hidden association patterns in different biological networks. The experimental results show that iPiDA-GCN is able to detect new piRNA-disease associations, and outperforms the other state-of-the-art methods.
biochemical research methods,mathematical & computational biology
What problem does this paper attempt to address?