Multivariate curve resolution- based data fusion approaches applied in 1 H NMR metabolomic analysis of healthy cohorts

AndrésR. Martínez Bilesio,Francesc Puig-Castellví,Romà Tauler,Mariela Sciara,Fabián Fay,Rodolfo M. Rasia,Paula Burdisso,Alejandro G. García-Reiriz
DOI: https://doi.org/10.1016/j.aca.2024.342689
IF: 6.911
2024-05-06
Analytica Chimica Acta
Abstract:Background Metabolomics plays a critical role in deciphering metabolic alterations within individuals, demanding the use of sophisticated analytical methodologies to navigate its intricate complexity. While many studies focus on single biofluid types, simultaneous analysis of multiple matrices enhances understanding of complex biological mechanisms. Consequently, the development of data fusion methods enabling multiblock analysis becomes essential for comprehensive insights into metabolic dynamics. Results This study introduces a novel guideline for jointly analyzing diverse metabolomic datasets (serum, urine, metadata) with a focus on metabolic differences between groups within a healthy cohort. The guideline presents two fusion strategies, 'Low-Level data fusion' (LLDF) and 'Mid-Level data fusion' (MLDF), employing a sequential application of Multivariate Curve Resolution with Alternating Least Squares (MCR-ALS), linking the outcomes of successive analyses. MCR-ALS is a versatile method for analyzing mixed data, adaptable at various stages of data processing—encompassing resonance integration, data compression, and exploratory analysis. The LLDF and MLDF strategies were applied to 1 H NMR spectral data extracted from urine and serum samples, coupled with biochemical metadata sourced from 145 healthy volunteers. Significance Both methodologies effectively integrated and analysed multiblock datasets, unveiling the inherent data structure and variables associated with discernible factors among healthy cohorts. While both approaches successfully detected sex-related differences, the MLDF strategy uniquely revealed components linked to age. By applying this analysis, we aim to enhance the interpretation of intricate biological mechanisms and uncover variations that may not be easily discernible through individual data analysis.
chemistry, analytical
What problem does this paper attempt to address?