Integrating HRMAS-NMR Data and Machine Learning-Assisted Profiling of Metabolite Fluxes to Classify Low- and High-Grade Gliomas

Safia Firdous,Zubair Nawaz,Rizwan Abid,Leo L. Cheng,Syed Ghulam Musharraf,Saima Sadaf
DOI: https://doi.org/10.1007/s12539-024-00642-x
2024-09-28
Interdisciplinary Sciences Computational Life Sciences
Abstract:Diagnosing and classifying central nervous system tumors such as gliomas or glioblastomas pose a significant challenge due to their aggressive and infiltrative nature. However, recent advancements in metabolomics and magnetic resonance spectroscopy (MRS) offer promising avenues for differentiating tumor grades both in vivo and ex vivo. This study aimed to explore tissue-based metabolic signatures to classify/distinguish between low- and high-grade gliomas. Forty-six histologically confirmed, intact solid tumor samples from glioma patients were analyzed using high-resolution magic angle spinning nuclear magnetic resonance (HRMAS-NMR) spectroscopy. By integrating machine learning (ML) algorithms, spectral regions with the most discriminative potential were identified. Validation was performed through univariate and multivariate statistical analyses, along with HRMAS-NMR analyses of 46 paired plasma samples. Amongst the various ML models applied, the logistics regression identified 46 spectral regions capable of sub-classifying gliomas with accuracy 87% (F1-measure 0.87, Precision 0.82, Recall 0.93), whereas the extra-tree classifier identified three spectral regions with predictive accuracy of 91% (F1-measure 0.91, Precision 0.85, Recall 0.97). Wilcoxon test presented 51 spectral regions significantly differentiating low- and high-grade glioma groups ( p < 0.05). Based on sensitivity and area under the curve values, 40 spectral regions corresponding to 18 metabolites were considered as potential biomarkers for tissue-based glioma classification and amongst these N-acetyl aspartate, glutamate, and glutamine emerged as the most important markers. These markers were validated in paired plasma samples, and their absolute concentrations were computed. Our results demonstrate that the metabolic markers identified through the HRMAS-NMR-ML analysis framework, and their associated metabolic networks, hold promise for targeted treatment planning and clinical interventions in the future.
mathematical & computational biology
What problem does this paper attempt to address?