Metabolomic Strategy Using Proton Nuclear Magnetic Resonance (1H NMR) and Machine Learning to Identify the Geographic Origins of the Seeds of Croton tiglium L.

Didaer Walibieke Li Manqi Lu Meng Liang Xinyue Rao Li Li Yan Xiao Jiaxun Zhou Kai a Chongqing Key Laboratory of High Active Traditional Chinese Drug Delivery System,Chongqing Medical and Pharmaceutical College,Chongqing,Chinab School of Pharmacy,Chongqing University,Chongqing,Chinac Analytical and Testing Center,Chongqing University,Chongqing,China
DOI: https://doi.org/10.1080/00032719.2024.2406442
2024-09-25
Analytical Letters
Abstract:The geographical origin of traditional Chinese medicine, which concerns the growth, collection, and processing location, significantly affecting the quality and efficacy of the materials and is a key indicator for the evaluation and identification. This work utilized untargeted proton nuclear magnetic resonance ( 1 H NMR) metabolomics alongside machine learning techniques to accurately identify the geographical origins of 40 Croton tiglium L. samples from Sichuan, Guangxi, Henan, and Dabie Mountain Employing solvents with varying polarities for sample pretreatment, principal component analysis (PCA) suggests that extraction using petroleum ether (PE) optimally retains metabolite information, effectively distinguishing Croton tiglium L. samples across regions. There are eighteen major metabolites, including alkanes, esters, organic acids, and lipids were identified in the PE extract. The nonlinear random forest algorithms of geographical origin achieved an accuracy of 100% for both the training and test sets. The organic acids and ester were highlighted as the variable majorly responsible for this separation by machine learning algorithms. This approach provides an effective method for determining the origin and quality of the seeds of Croton tiglium L. with broad application prospects.
chemistry, analytical
What problem does this paper attempt to address?