Exploring Disease Similarity by Integrating Multiple Data Sources

Lei Deng,Danyi Ye,Junmin Zhao,Jingpu Zhang
DOI: https://doi.org/10.1109/bibm.2018.8621526
2018-01-01
Abstract:A growing collection of disease-associated data contributes to the research of disease similarity. Discovering closely related diseases could be helpful in revealing their common pathogenic mechanisms. This might further suggest treatment that can be appropriated from one disease to another. A number of methods for computing disease similarity have been developed during the past decades. However, most of them are designed to take advantage of single or few data sources, which leads to their low accuracy. In this study, we propose a new method named MultiSourcDSim for computing disease similarity by integrating multiple data sources, namely, gene-disease associations, disease-GO biological process associations and symptom-disease associations. The experimental results show the disease similarity calculated by MultiSourcDSim has a significant correlation with disease classification of Medical Subject Headings. Furthermore, compared with the other three popular methods, MultiSourcDSim achieves the best performance. Its average AUC reaches 0.906. In addition, the disease similarity network constructed by MultiSourcDSim suggests that our method is capable of finding potential associations between diseases.
What problem does this paper attempt to address?