The preliminary comparison of four correlation analysis methods for association between microbiota and metabolites

Yijun YOU,Dandan LIANG,Tianlu CHEN
DOI: https://doi.org/10.3969/j.issn.2095-3097.2018.02.008
2018-01-01
Abstract:Objective High-throughout omics data with massive data size contains diverse information,and the relationships among variables are complex. Correlation analysis is one of the ef-fective tools for translational medicine and systems biology study and is helpful for digging out valid correlation pairs from big data. Microbiome and metabolomics platform which equipped with integral systematic function are widely used in the association analysis between microbiota and metabolites. Considering the data sources,structures and characteristics are all different between microbiome data and metabolomics data,scientific correlation method selection is needed for high quality cross-omics researches. Methods In this paper, four typical correlation analysis methods were selected (two classic methods and two specific analysis methods designed for compositional data) and the perform-ance of all methods were tested and compared using simulated and real datasets. Results Results of simulated and real datasets suggested that correlation coefficient computed by CCLasso was mini-mum,its percentage error was maximum,and the number of correlated pairs found by CCLasso was least. On the contrary,results of SparCC were opposite to those of CCLasso. Pearson and Spearman performed between CCLasso and SparCC. Conclusion For the correlation analysis between metabo-lomic and microbiome data, CCLasso is more stringent than the others and prone to provide false-negative results easily. SparCC is looser and prone to achieve false-positive results. The error risks of Pearson and Spearman are between CCLasso and SparCC. Both aim and emphasis should be consid-ered for researchers with a suitable method selection.
What problem does this paper attempt to address?