Species identification through deep learning and geometrical morphology in oaks (Quercus spp.): Pros and cons

Min Qi,Fang K. Du,Fei Guo,Kangquan Yin,Jijun Tang
DOI: https://doi.org/10.1002/ece3.11032
IF: 3.167
2024-02-15
Ecology and Evolution
Abstract:We conducted a comparative analysis to evaluate the accuracy and efficiency of the GMMs and deep learning methods in discriminating two closely related oaks, Q. aliena and Q. dentata, using genetics as a priori classification. We found that deep learning is the most cost‐efficient method in terms of time and cost, whereas GMMs can confirm the leaf shape of admixture individuals and demonstrate the tendency of leaf contraction and expansion. Moreover, we found that the shape of admixture individuals was close to that of Q. dentata, suggesting that oaks retain high levels of fitness variation, with Q. aliena being more favored by selection in leaf morphological traits. Plant phenotypic characteristics, especially leaf morphology of leaves, are an important indicator for species identification. However, leaf shape can be extraordinarily complex in some species, such as oaks. The great variation in leaf morphology and difficulty of species identification in oaks have attracted the attention of scientists since Charles Darwin. Recent advances in discrimination technology have provided opportunities to understand leaf morphology variation in oaks. Here, we aimed to compare the accuracy and efficiency of species identification in two closely related deciduous oaks by geometric morphometric method (GMM) and deep learning using preliminary identification of simple sequence repeats (nSSRs) as a prior. A total of 538 Asian deciduous oak trees, 16 Q. aliena and 23 Q. dentata populations, were firstly assigned by nSSRs Bayesian clustering analysis to one of the two species or admixture and this grouping served as a priori identification of these trees. Then we analyzed the shapes of 2328 leaves from the 538 trees in terms of 13 characters (landmarks) by GMM. Finally, we trained and classified 2221 leaf‐scanned images with Xception architecture using deep learning. The two species can be identified by GMM and deep learning using genetic analysis as a priori. Deep learning is the most cost‐efficient method in terms of time‐consuming, while GMM can confirm the admixture individuals' leaf shape. These various methods provide high classification accuracy, highlight the application in plant classification research, and are ready to be applied to other morphology analysis.
ecology,evolutionary biology
What problem does this paper attempt to address?