Computational Screening of Umami Tastants Using Deep Learning

Prantar Dutta,Kishore Gajula,Rakesh Gupta,Beena Rai
DOI: https://doi.org/10.26434/chemrxiv-2024-spz16
2024-03-20
Abstract:Umami, a fundamental human taste modality, refers to the savory flavors in meats and broths, often associated with monosodium glutamate and protein richness. With limited knowledge of umami molecules, the food industry seeks efficient approaches for identifying novel tastants. In this study, we have devised a virtual screening pipeline for identifying potential novel umami tastants from molecular databases. We first curated a comprehensive classification dataset containing 439 umami and 428 non-umami molecules. A transformer-based architecture was trained to differentiate between the two classes, achieving the best performance to date. Additionally, we built a neural network model for predicting the potency of umami compounds, the first effort of its kind. These two models, in conjunction with similarity analysis and toxicity screening, form an end-to-end framework for the rational discovery of novel tastants. We finally applied this framework to the FooDB database as an illustrative use case. This study demonstrates the potential of data-driven methods in predicting the taste of molecules from structural and chemical features.
Chemistry
What problem does this paper attempt to address?
This paper mainly discusses the use of deep learning to screen umami taste substances. Umami is one of the basic taste patterns of human beings, which is usually associated with the savory taste in meat and soup and is related to high levels of monosodium glutamate and protein. Due to limited understanding of umami molecules, the food industry needs more effective methods to find new umami substances. In the study, the authors constructed a virtual screening pipeline to identify potential new umami substances from molecular databases. They first created a classification dataset containing 439 umami molecules and 428 non-umami molecules, and trained a model based on Transformer architecture to distinguish between the two classes of molecules, achieving the best performance to date. In addition, they developed a neural network model to predict the intensity of umami compounds, which is the first attempt of its kind. These two models, combined with similarity analysis and toxicity screening, form a complete end-to-end framework for the rational discovery of new umami substances. The paper demonstrates the effectiveness of this framework using the FooDB database as an example application. The study highlights the potential of data-driven approaches in predicting taste based on molecular structure and chemical characteristics, and notes that despite existing models for sweet and bitter tastes, relatively less work has been done in predicting umami taste. In the paper, the authors constructed the largest umami classification dataset to date and used deep learning techniques for molecule classification and intensity prediction, aiming to improve the efficiency and accuracy of compound taste prediction in the food industry.