Enrichment analysis for spatial and single-cell metabolomics accounting for molecular ambiguity

Bishoy Wadie,Martijn R Molenaar,Lucas Maciel Vieira,Theodore Alexandrov
DOI: https://doi.org/10.1101/2024.08.23.609355
2024-08-23
Abstract:Imaging mass spectrometry (imaging MS) has advanced spatial and single-cell metabolomics, but the reliance on MS1 data complicates the accurate identification of molecular structures, not being able to resolve isomeric and isobar molecules. This prevents application of conventional methods for overrepresentation analysis (ORA) and metabolite set enrichment analysis (MSEA). To address this, we introduce S2IsoMEr R package and a web app for METASPACE, which uses bootstrapping to propagate isomeric/isobaric ambiguities into the enrichment analysis. We demonstrate S2IsoMEr for single-cell metabolomics and the METASPACE web app for spatial metabolomics.
Bioinformatics
What problem does this paper attempt to address?
The paper primarily addresses the issue of molecular structure identification ambiguity caused by the limitations of mass spectrometry primary data (MS1 data) in spatial metabolomics and single-cell metabolomics. Specifically, while imaging mass spectrometry (imaging MS) technology has advanced the fields of spatial metabolomics and lipidomics and made it possible to obtain metabolomics data at the single-cell level, the lack of targeted tandem mass spectrometry (MS/MS) fragment analysis complicates the precise identification of molecular structures. This ambiguity makes traditional Overrepresentation Analysis (ORA) and Metabolite Set Enrichment Analysis (MSEA) difficult to apply. To address this issue, the research team developed two methods to handle the uncertainty of molecular isomers and isotopologues: 1. **METASPACE**: This is an online platform that guides the uncertainty of molecular isomers and isotopologues into enrichment analysis using bootstrapping for iterative random sampling. 2. **S2IsoMEr R package**: This is an R software package designed specifically for single-cell metabolomics data, which also uses bootstrapping to handle the uncertainty of molecular isomers and isotopologues and supports ranking-based enrichment analysis. Both methods effectively address the issue of molecular identification ambiguity caused by MS1 data, making enrichment analysis more accurate and reliable.