Predicting structural groups of small molecules from 1H NMR spectral features using common machine learning classifiers

Igor d'Anciães Almeida Silva
DOI: https://doi.org/10.26434/chemrxiv-2024-jsvm3
2024-09-20
Abstract:Structural determination of molecules using solution-state nuclear magnetic resonance (NMR) is a time-consuming effort mostly due to spectral analysis and correlation of spectral features with structural motifs. A few machine learning methods exist to aid this step of the workflow, requiring at least 1H and 13C chemical shifts to make predictions. In this paper we show that it is possible to predict, with good accuracy (> 0.8), structural groups of small molecules using only 1H NMR spectral features (chemical shift, J-coupling, integral values, and splitting patterns). For this task we employed common machine learning classifiers found in the sklearn python module, and a database constructed using 1H NMR spectra found in online NMR tools (NMR-Challenge and NMRium).
Chemistry
What problem does this paper attempt to address?
The problem this paper attempts to address is the prediction of structural groups in small molecules using one-dimensional proton nuclear magnetic resonance (^1H NMR) spectral features. Specifically, the researchers aim to predict structural groups in small molecules by using only ^1H NMR spectral features (such as chemical shifts, J-coupling, integration values, and splitting patterns) and achieve a high accuracy (>0.8). This method is intended to simplify and accelerate the process of determining molecular structures through NMR spectroscopy, reducing the need for carbon nuclear magnetic resonance (^13C NMR) and other two-dimensional NMR experiments, thereby improving work efficiency.