Comparison of the performance of multiple whole-genome sequence-based tools for the identification of biovar Thuringiensis

Taejung Chung,Abimel Salazar,Grant Harm,Sophia Johler,Laura M. Carroll,Jasna Kovac
DOI: https://doi.org/10.1101/2024.01.23.575246
2024-01-24
Abstract:The ( ) species comprises strains of biovar ( ) known for their bioinsecticidal activity, as well as strains with foodborne pathogenic potential. strains are identified (i) based on the production of insecticidal crystal proteins also known as Bt toxins or (ii) based on the presence of , , and genes, which encode Bt toxins. Multiple bioinformatics tools have been developed for the detection of crystal protein-encoding genes based on whole-genome sequencing (WGS) data. However, the performance of these tools is yet to be evaluated using phenotypic data. Thus, the goal of this study was to assess the performance of four bioinformatics tools for the detection of crystal protein-encoding genes. The accuracy of sequence-based identification of was determined in reference to phenotypic microscope-based screening for production of crystal proteins. A total of 58 diverse strains isolated from clinical, food, environmental, and commercial biopesticide products were underwent WGS. Isolates were examined for crystal protein production using phase contrast microscopy. Crystal protein-encoding genes were detected using BtToxin_Digger, BTyper3, IDOPS, and Cry_processor. Out of 58 isolates, the phenotypic production of crystal proteins was confirmed for 18 isolates. Specificity and sensitivity of identification based on sequences were 0.85 and 0.94 for BtToxin_Digger, 0.97 and 0.89 for BTyper3, 0.95 and 0.94 for IDOPS, and 0.88 and 1.00 for Cry_processor, respectively. Cry_processor predicted crystal protein production with highest specificity, and BtToxin_Digger and IDOPS predicted crystal protein production with the highest sensitivity. Three out of four tested bioinformatic tools performed well overall, with IDOPS achieving both high sensitivity and specificity (>0.90).
Bioinformatics
What problem does this paper attempt to address?