Protein-protein Interaction Network with Machine Learning Models and Multiomics Data Reveal Potential Neurodegenerative Disease-Related Proteins

Xinjian Yu,Siqi Lai,Hongjun Chen,Ming Chen
DOI: https://doi.org/10.1093/hmg/ddaa065
IF: 5.1214
2020-01-01
Human Molecular Genetics
Abstract:Research of protein-protein interaction in several model organisms is accumulating since the development of high-throughput experimental technologies and computational methods. The protein-protein interaction network (PPIN) is able to examine biological processes in a systematic manner and has already been used to predict potential disease-related proteins or drug targets. Based on the topological characteristics of the PPIN, we investigated the application of the random forest classification algorithm to predict proteins that may cause neurodegenerative disease, a set of pathological changes featured by protein malfunction. By integrating multiomics data, we further showed the validity of our machine learning model and narrowed down the prediction results to several hub proteins that play essential roles in the PPIN. The novel insights into neurodegeneration pathogenesis brought by this computational study can indicate promising directions for future experimental research.
What problem does this paper attempt to address?