Phylogenetic Relationships and Next-Generation Barcodes in the Genus Torreya Reveal a High Proportion of Misidentified Cultivated Plants

Zhi-Qiong Mo,Jie Wang,Michael Möller,Jun-Bo Yang,Lian-Ming Gao
DOI: https://doi.org/10.3390/ijms241713216
IF: 5.6
2023-08-26
International Journal of Molecular Sciences
Abstract:Accurate species identification is key to conservation and phylogenetic inference. Living plant collections from botanical gardens/arboretum are important resources for the purpose of scientific research, but the proportion of cultivated plant misidentification are un-tested using DNA barcodes. Here, we assembled the next-generation barcode (complete plastid genome and complete nrDNA cistron) and mitochondrial genes from genome skimming data of Torreya species with multiple accessions for each species to test the species discrimination and the misidentification proportion of cultivated plants used in Torreya studies. A total of 38 accessions were included for analyses, representing all nine recognized species of genus Torreya. The plastid phylogeny showed that all 21 wild samples formed species-specific clades, except T. jiulongshanensis. Disregarding this putative hybrid, seven recognized species sampled here were successfully discriminated by the plastid genome. Only the T. nucifera accessions grouped into two grades. The species identification rate of the nrDNA cistron was 62.5%. The Skmer analysis based on nuclear reads from genome skims showed promise for species identification with seven species discriminated. The proportion of misidentified cultivated plants from arboreta/botanical gardens was relatively high with four accessions (23.5%) representing three species. Interspecific relationships within Torreya were fully resolved with maximum support by plastomes, where Torreya jackii was on the earliest diverging branch, though sister to T. grandis in the nrDNA cistron tree, suggesting that this is likely a hybrid species between T. grandis and an extinct Torreya ancestor lineage. The findings here provide quantitative insights into the usage of cultivated samples for phylogenetic study.
biochemistry & molecular biology,chemistry, multidisciplinary
What problem does this paper attempt to address?
The main problems that this paper attempts to solve are: 1. **Evaluating the efficiency of the new - generation barcodes (complete chloroplast genomes and ribosomal DNA segments) and nuclear genome information obtained through Skmer analysis in the species delimitation of Torreya genus**. Specifically, the author hopes that these methods can accurately identify and distinguish different species within the Torreya genus. 2. **Determining the proportion of misidentified plants in cultivated plants from botanical gardens or arboretums**. Since plants in botanical gardens may be misidentified due to label errors or changes in morphological characteristics, this may affect the accuracy of phylogenetic studies based on these samples. 3. **Inferring the phylogenetic relationships among species within the Torreya genus**. By comparing data of chloroplasts, ribosomal DNA segments and mitochondrial genes, the author hopes to reveal the evolutionary relationships among these species and explore possible hybridization or introgression phenomena. ### Background and Motivation - **Importance of species identification**: Accurate species identification is crucial for conservation biology and phylogenetic research. Especially when using cultivated plant samples, misidentification may lead to incorrect inferences of phylogenetic relationships and biogeographical history. - **Application of new - generation sequencing technologies**: With the development of new - generation sequencing technologies, it has become possible to use complete chloroplast genomes, ribosomal DNA segments and nuclear genome data for species identification. These methods are excellent in distinguishing closely related species. - **Problems with cultivated plant samples**: Although plant samples in botanical gardens and arboretums are abundant, there may be misidentification problems caused by label errors or changes in morphological characteristics. These problems will affect the research results based on these samples. ### Research Methods - **Data collection**: The author collected samples of multiple individuals from all known species of the Torreya genus, including wild and cultivated samples. - **Genome sequencing**: Obtain data of chloroplast genomes, ribosomal DNA segments and mitochondrial genes through genome skimming. - **Data analysis**: Use the Maximum Likelihood (ML) method and Bayesian Inference (BI) to construct phylogenetic trees, and further verify the accuracy of species identification through Skmer analysis. ### Main Findings - **Efficiency of species delimitation**: The new - generation barcodes and nuclear genome information show high accuracy in species delimitation, and most species can be correctly identified. - **Proportion of misidentified cultivated plants**: The study found that there is a high proportion of misidentified samples in cultivated plants, especially for some species. - **Phylogenetic relationships**: Based on the complete chloroplast genome data, the species relationships within the Torreya genus have been fully resolved. Among them, T. jackii is the first - diverging species, and the clade formed by North American species (T. taxifolia and T. californica) is the sister group of other East Asian species clades. ### Conclusion This study, through new - generation sequencing technologies and multiple analysis methods, not only improves the accuracy of species identification in the Torreya genus, but also reveals the misidentification problems existing in cultivated plant samples, providing an important reference for future conservation biology and phylogenetic research.