MNNMDA: Predicting Human Microbe-Disease Association via a Method to Minimize Matrix Nuclear Norm
Haiyan Liu,Pingping Bing,Meijun Zhang,Geng Tian,Jun Ma,Haigang Li,Meihua Bao,Kunhui He,Jianjun He,Binsheng He,Jialiang Yang
DOI: https://doi.org/10.1016/j.csbj.2022.12.053
IF: 6.155
2023-01-04
Computational and Structural Biotechnology Journal
Abstract:Identifying the potential associations between microbes and diseases is the first step for revealing the pathological mechanisms of microbe-associated diseases. However, traditional culture-based microbial experiments are expensive and time-consuming. Thus, it is critical to prioritize disease-associated microbes by computational methods for further experimental validation. In this study, we proposed a novel method called MNNMDA, to predict microbe-disease associations (MDAs) by applying a Matrix Nuclear Norm method into known microbe and disease data. Specifically, we first calculated Gaussian interaction profile kernel similarity and functional similarity for diseases and microbes. Then we constructed a heterogeneous information network by combining the integrated disease similarity network, the integrated microbe similarity network and the known microbe-disease bipartite network. Finally, we formulated the microbe-disease association prediction problem as a low-rank matrix completion problem, which was solved by minimizing the nuclear norm of a matrix with a few regularization terms. We tested the performances of MNNMDA in three datasets including HMDAD, Disbiome, and Combined Data with small, medium and large sizes respectively. We also compared MNNMDA with 5 state-of-the-art methods including KATZHMDA, LRLSHMDA, NTSHMDA, GATMDA, and KGNMDA, respectively. MNNMDA achieved area under the ROC curves (AUROC) of 0.9536 and 0.9364 respectively on HDMAD and Disbiome, better than the AUCs of compared methods under the 5-fold cross-validation for all microbe-disease associations. It also obtained a relatively good performance with AUROC 0.8858 in the combined data. In addition, MNNMDA was also better than other methods in area under precision and recall curve (AUPR) under the 5-fold cross-validation for all associations, and in both AUROC and AUPR under the 5-fold cross-validation for diseases and the 5-fold cross-validation for microbes. Finally, the case studies on colon cancer and inflammatory bowel disease (IBD) also validated the effectiveness of MNNMDA. In conclusion, MNNMDA is an effective method in predicting microbe-disease associations. Availability The codes and data for this paper are freely available at Github https://github.com/Haiyan-Liu666/MNNMDA. Graphical Download : Download high-res image (184KB) Download : Download full-size image In this study, we proposed a novel method called MNNMDA, to predict microbe-disease associations (MDAs) by applying a minimizing Matrix Nuclear Norm method into known microbe and disease data. Specifically, we explored literatures and related databases to obtain high-quality known relationship between microbes and diseases. By incorporating multiple similarity networks between microbes and diseases, we formulated the microbe-disease association prediction problem as a low-rank matrix completion problem, which was solved by minimizing the nuclear norm of a matrix with a few regularization terms.
biochemistry & molecular biology