metabCombiner 2.0: Disparate Multi-Dataset Feature Alignment for LC-MS Metabolomics
Hani Habra,Jennifer L. Meijer,Tong Shen,Oliver Fiehn,David A. Gaul,Facundo M. Fernández,Kaitlin R. Rempfert,Thomas O. Metz,Karen E. Peterson,Charles R. Evans,Alla Karnovsky
DOI: https://doi.org/10.3390/metabo14020125
IF: 4.1
2024-02-16
Metabolites
Abstract:Liquid chromatography–high-resolution mass spectrometry (LC-HRMS), as applied to untargeted metabolomics, enables the simultaneous detection of thousands of small molecules, generating complex datasets. Alignment is a crucial step in data processing pipelines, whereby LC-MS features derived from common ions are assembled into a unified matrix amenable to further analysis. Variability in the analytical factors that influence liquid chromatography separations complicates data alignment. This is prominent when aligning data acquired in different laboratories, generated using non-identical instruments, or between batches from large-scale studies. Previously, we developed metabCombiner for aligning disparately acquired LC-MS metabolomics datasets. Here, we report significant upgrades to metabCombiner that enable the stepwise alignment of multiple untargeted LC-MS metabolomics datasets, facilitating inter-laboratory reproducibility studies. To accomplish this, a "primary" feature list is used as a template for matching compounds in "target" feature lists. We demonstrate this workflow by aligning four lipidomics datasets from core laboratories generated using each institution's in-house LC-MS instrumentation and methods. We also introduce batchCombine, an application of the metabCombiner framework for aligning experiments composed of multiple batches. metabCombiner is available as an R package on Github and Bioconductor, along with a new online version implemented as an R Shiny App.
biochemistry & molecular biology