An Efficient Approach for Obtaining Small and Macro-molecular 1H NMR Spectra Based on Neural Network

xiongjie xiao,Qianqian Wang,Xin Chai,Xu Zhang,Bin Jiang,Maili Liu
DOI: https://doi.org/10.26434/chemrxiv-2024-nh9mx
2024-01-23
Abstract:Metabolomics plays a vital role in comprehending cellular and organismal metabolic processes. In NMR-based metabolomics studies, specific NMR pulse sequences such as the standard 1D nuclear Overhauser effect spectroscopy (NOESY), 1D Carr-Purcell-Meiboom-Gill (CPMG), and 1D diffusion-edited sequences are commonly utilized to detect distinctive NMR characteristics of small molecule and macromolecule metabolites in plasma or serum samples. However, conducting NMR experiments on multiple samples in metabolomics can be time-consuming. This study introduces the Spectrum-Edited Neural Network (SENNet) for efficient and accurate separation of spectral signals from both macromolecules and small molecules in 1H NMR spectra. The proposed model provides an end-to-end mapping of the entire metabolome NMR spectrum to the macromolecular and small molecule NMR spectra. To validate and optimize the model's hyperparameters, we employed a total of 113 serum samples. Furthermore, the SENNet method was applied to post-process 1D NOESY-presat spectra obtained from 120 plasma samples and 463 serum samples, which were then compared with the corresponding 1D CPMG spectra and 1D diffusion-edited spectra. Our results demonstrate the effective extraction of small molecule signals using the proposed method, as confirmed by comparison with experimental spectra. Principal component analysis (PCA) performed on the macromolecule and small molecule signals reveals comparable statistical information to analyses conducted using experimental data, indicating the efficiency of the SENNet method for signal extraction. This high-throughput NMR post-processing method holds substantial potential for metabolomics research. Additionally, the SENNet method serves as a valuable reference for separating signals from both macro and small molecules in NMR samples.
Chemistry
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to efficiently and accurately separate the signals of small molecules and large molecules from 1H NMR (nuclear magnetic resonance) spectra in metabolomics research. Traditional NMR experimental methods are time - consuming when dealing with multiple samples, and it is difficult to reliably measure peak intensity or integrate NMR signals due to signal overlap. In particular, lipoproteins in plasma or serum samples will produce broad peaks in NMR spectra, and these broad peaks overlap with the sharp peaks of proteins such as albumin and small - molecule metabolites, increasing the difficulty of analysis. In addition, traditional sample pretreatment techniques such as ultrafiltration or precipitation methods can partially solve this problem, but the process is time - consuming and may lead to the loss of large - molecule information. To solve these problems, this study proposes a neural - network - based method - Spectrum - Edited Neural Network (SENNet) for efficiently and accurately separating the signals of large molecules and small molecules from 1H NMR spectra. SENNet edits spectral peaks by linewidth at half - height, thereby achieving signal separation. This method can not only improve the efficiency of NMR post - processing, but also accurately extract small - molecule signals without losing large - molecule information, providing new tools and technical support for metabolomics research.