Drug-disease association prediction using semantic graph and function similarity representation learning over heterogeneous information networks

Bo-Wei Zhao,Xiao-Rui Su,Yue Yang,Dong-Xu Li,Guo-Dong Li,Peng-Wei Hu,Yong-Gang Zhao,Lun Hu
DOI: https://doi.org/10.1016/j.ymeth.2023.10.014
IF: 4.647
2023-12-01
Methods
Abstract:Discovering new indications for existing drugs is a promising development strategy at various stages of drug research and development. However, most of them complete their tasks by constructing a variety of heterogeneous networks without considering available higher-order connectivity patterns in heterogeneous biological information networks, which are believed to be useful for improving the accuracy of new drug discovering. To this end, we propose a computational-based model, called SFRLDDA, for drug-disease association prediction by using semantic graph and function similarity representation learning. Specifically, SFRLDDA first integrates a heterogeneous information network (HIN) by drug-disease, drug-protein, protein-disease associations, and their biological knowledge. Second, different representation learning strategies are applied to obtain the feature representations of drugs and diseases from different perspectives over semantic graph and function similarity graphs constructed, respectively. At last, a Random Forest classifier is incorporated by SFRLDDA to discover potential drug-disease associations (DDAs). Experimental results demonstrate that SFRLDDA yields a best performance when compared with other state-of-the-art models on three benchmark datasets. Moreover, case studies also indicate that the simultaneous consideration of semantic graph and function similarity of drugs and diseases in the HIN allows SFRLDDA to precisely predict DDAs in a more comprehensive manner.
biochemistry & molecular biology,biochemical research methods
What problem does this paper attempt to address?