a deep learning-based framework for secondary metabolite class prediction using fruit and leaf images

Neeraja M Krishnan,Saroj Kumar,Binay Panda
DOI: https://doi.org/10.1101/2024.03.13.584746
2024-03-14
Abstract:Fruits produce a wide variety of secondary metabolites of great economic value. Analytical measurement of secondary metabolites is tedious, time-consuming and expensive. Additionally, metabolite concentration varies greatly from tree to tree, making it difficult to choose trees for fruit collection. The current study tested whether deep learning-based models can be developed using fruit and leaf images alone to predict a metabolite’s concentration class ( or ). We collected fruits and leaves ( ) from neem trees grown in the wild across 0.6 million sq km, imaged those, measured concentration of five metabolites (azadirachtin, deacetyl-salannin, salannin, nimbin and nimbolide) using high-performance liquid chromatography and used those to train deep learning models for metabolite class prediction. The best model out of the seven tested ( , , , _ , _ , , and ) provided a validation score of 0.93 and a test score of 0.88. The sensitivity and specificity of the fruit model alone in the test set were 83.52 ± 6.19 and 82.35 ± 5.96 and 79.40 ± 8.50 and 85.64 ± 6.21, for the and the class, respectively. The sensitivity was further boosted to 92.67± 5.25 for the class and 88.11 ± 9.17 for the class and the specificity to 100% for both classes, using a multi-analyte framework. We incorporated the model in an Android mobile App that uses fruit and leaf images to decide whether to ‘pick’ or ‘not pick’ the fruits from a specific tree based on the metabolite concentration class. Our study provides evidence that images of fruits and leaves alone can predict the concentration class of a secondary metabolite without using extensive analytical laboratory procedures and equipment and makes the process of choosing the right tree for fruit collection easy and free of equipment and additional cost.
Bioinformatics
What problem does this paper attempt to address?