Serum metabolic profiling of schizophrenia based on random forest

Yingjun LIU,Tao ZHANG,Lu WANG,Jia LIU,Xuerun CHANG,Jingxuan ZHANG,Fuzhong XUE
DOI: https://doi.org/10.6040/j.issn.1671-7554.0.2014.476
2015-01-01
Abstract:Objective To explore the classification ability of random forest in the serum metabolic profiling of schizo-phrenia patients and healthy controls and to select significant metabolites.Methods The case group consisted of 50 patients with schizophrenia and control group consisted of 62 healthy individuals.The serum samples of case and control groups were collected and detected by RRLC-QTOF/MS platform.Random forest was used to classify the serum metabol-ic data in case and control groups.OOB estimate of error rate and 5 fold cross validation were used to evaluate the classi-fication ability.In addition,variable importance measure of random forest was adopted to select important metabolites. Results Schizophrenia and control serum metabolic data could be classified well using the method of random forest.The misclassification rates in case and control groups were 4.0% and 1.6% respectively,OOB estimate of error rate was 2.68%,and the area under the curve of ROC was 0.99.Furthermore,15 important metabolites were selected according to variable importance measure.Conclusion The combination of liquid chromatography-mass spectrum technology with random forest can select metabolites with potential clinical application value,and be used in the study of metabolomics.
What problem does this paper attempt to address?