MultiSourcDSim: an Integrated Approach for Exploring Disease Similarity

Lei Deng,Danyi Ye,Junmin Zhao,Jingpu Zhang
DOI: https://doi.org/10.1186/s12911-019-0968-8
IF: 3.298
2019-01-01
BMC Medical Informatics and Decision Making
Abstract:A collection of disease-associated data contributes to study the association between diseases. Discovering closely related diseases plays a crucial role in revealing their common pathogenic mechanisms. This might further imply treatment that can be appropriated from one disease to another. During the past decades, a number of approaches for calculating disease similarity have been developed. However, most of them are designed to take advantage of single or few data sources, which results in their low accuracy. In this paper, we propose a novel method, called MultiSourcDSim, to calculate disease similarity by integrating multiple data sources, namely, gene-disease associations, GO biological process-disease associations and symptom-disease associations. Firstly, we establish three disease similarity networks according to the three disease-related data sources respectively. Secondly, the representation of each node is obtained by integrating the three small disease similarity networks. In the end, the learned representations are applied to calculate the similarity between diseases. Our approach shows the best performance compared to the other three popular methods. Besides, the similarity network built by MultiSourcDSim suggests that our method can also uncover the latent relationships between diseases. MultiSourcDSim is an efficient approach to predict similarity between diseases.
What problem does this paper attempt to address?