Abstract:Lipids are involved in many vital processes within the cell, and alterations in lipid homeostasis have been associated with various diseases such as cancer or type 2 diabetes. Confidently identifying lipids in samples is a prerequisite for understanding the multiple functions lipids fulfill in health and disease. However, the accurate identification of molecular lipid species based on tandem mass spectrometry data is still a key challenge in lipidomics. Most current approaches rely on using a custom pipeline to process and match the measured spectra against an in-house spectra reference library, which hinders the comparability of results. To address this challenge, a transformer model called LipiDetective was developed and trained on a dataset composed of reference spectra measured from lipid standards, spectra from databases, and published experiments, utilizing both shotgun as well as liquid-chromatography mass spectrometry. LipiDetective demonstrates, for the first time, that artificial neural networks can learn the characteristic lipid fragmentation patterns to automatically and accurately annotate molecular lipids species in tandem mass spectra independently of the experimental setup. The model can even correctly predict lipid species for which it has never seen a spectrum before as it is able to generalize the learned lipid fragmentation patterns. Analysis of the integrated gradients reveals that LipiDetective focuses on relevant peaks that can be matched to known fragments and are thus humanly interpretable. Therefore, LipiDetective has the potential to be a valuable tool to aid in the lipid identification process and support the comparability of results from different sources. Aside from Lipidetective as a "ready-to-use" application, this work primarily offers a deeper understanding of how the model functions and how future deep learning models for lipid identification in mass spectra could be improved.

Deriving Accurate Lipid Classification based on Molecular Formula

Molecular Formula Prediction for Chemical Filtering of 3D OrbiSIMS Datasets

LICAR: An Application for Isotopic Correction of Targeted Lipidomic Data Acquired with Class-Based Chromatographic Separations Using Multiple Reaction Monitoring

MS2Lipid: a lipid subclass prediction program using machine learning and curated tandem mass spectral data

Recommendations for Accurate Lipid Annotation and Semi-absolute Quantification from LC-MS/MS Datasets

Lipid Annotator: Towards Accurate Annotation in Non-Targeted Liquid Chromatography High-Resolution Tandem Mass Spectrometry (LC-HRMS/MS) Lipidomics Using a Rapid and User-Friendly Software

CFM-ID 3.0: Significantly Improved ESI-MS/MS Prediction and Compound Identification

Long Chain Base Profiling with Multiple Reaction Monitoring Mass Spectrometry

LipiDetective - a deep learning model for the identification of molecular lipid species in tandem mass spectra

Resolving Modifications on Sphingoid Base and N-Acyl Chain of Sphingomyelin Lipids in Complex Lipid Extracts

FIDDLE: a deep learning method for chemical formulas prediction from tandem mass spectra

MIST-CF: Chemical formula inference from tandem mass spectra

Lipidomics: Mass Spectrometry Based Untargeted Profiling And False Positives

Machine-learning assisted molecular formula assignment to high-resolution mass spectrometry data of dissolved organic matter

Systematic classification of unknown metabolites using high-resolution fragmentation mass spectra

Annotation of DOM metabolomes with an ultrahigh resolution mass spectrometry molecular formula library

Aza-Prilezhaev Aziridination-Enabled Multidimensional Analysis of Isomeric Lipids via High-Resolution U-Shaped Mobility Analyzer-Mass Spectrometry

IDSL.UFA assigns high confidence molecular formula annotations for untargeted LC/HRMS datasets in metabolomics and exposomics

Achieve broad lipid quantitation using a high-throughput targeted lipidomics method LC-based approach for lipid class separation and quantitation on high sensitivity SCIEX Triple Quad and QTRAP systems

Cross-Validation of Lipid Structure Assignment Using Orthogonal Ion Activation Modalities on the Same Mass Spectrometer

Lipid analysis and lipidomics by structurally selective ion mobility-mass spectrometry